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Executive  Summary 

The  studies  reported  were  carried  out  during  the  period  September  1991  through  August 
1993  under  the  auspices  of  the  Directorate  of  Aerospace  Sciences  of  the  AFOSR.  They  were 
concerned  with  the  dynamics  and  control  of  spaced  based  articulated  structures,  and  have  led 
to  a  better  understanding  of  the  theory  and  procedures  for  the  non-causal  inversion,  slewing 
and  control  of  systems  characterized  by  unstable  zero-dynamics.  In  particular  the  following 
results  can  be  highlighted; 

•  general  non-recursive  procedures  to  solve  the  inverse  dynamics  and  kinematics  of 
flexible  open-chain  and  closed-ciiain  nonlinear  articulated  structures  in  two  and  three 
dimensional  settings.  These  include  robust  multibody  dynamic  formulations  that  are 
efficient  and  stable  even  in  singular  configurations  and  in  the  presence  of  redundant 
algebraic  constraints. 

•  a  general  theory  for  non-causal  inversion  of  nonlinear  non-minimum  phase  systems 
which  relates  invertibility  and  hyperbolicity  of  zero  dynamics  equilibria. 

•  recursive  algorithms  in  three  dimensions  for  inverse  dynamics  that  include  lumped  and 
distributed  actuators. 

•  new  methods  for  the  sizing  and  the  placement  of  distributed  actuators  on  linear  and 
nonlinear  configurations. 

•  a  global  nonlinear  operator  framework  for  controlled  dynamical  systems  which 
presently  handles  local  dynamics. 

•  a  new  approach  to  nonlinear  tracking  control  which  does  not  require  solution  of  the 
Bymes-Isidori  PDE.  New  inverse  dynamics  based  control  methods  have  broad 
applications  (e.g.  nonlinear  flight  control,  vibration  control  of  nonlinear  structures, 
trajectory  control  of  towed  vehicles,  etc.) 

•  experimental  results  obtained  fix)m  a  smart  articulated  structure  experiment  (see  figure 
2.1)  (funded  by  Astro-Aerospace  Corporation)  designed  and  fabricated  at  UC^B. 
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1.  General  Procedures  for  the  Non*Causal  Inverse  Dynamics  of 
Nonlinear  Articulated  Structures. 

Summary 

This  study  addresses  the  problem  of  end-point  trajectory  tracking  in  nonlinear  flexible 
aniculated  structures  through  the  use  of  inverse  dynamics,  and  summarizes  the  work 
accomplished  under  the  contract  and  which  is  described  in  detail  in  references  6,  7,  and  8.  A 
global  Lagrangian  approach  is  employed  in  formulating  the  system  equations  of  motion,  and  an 
iterative  procedure  is  proposed  to  achieve  end-point  trajectory  tracking  in  three-dimensional, 
flexible  multibody  systems.  Each  iteration  involves  firstly,  an  inverse  kinematics  procedure 
wherein  elastic  displacements  are  determined  in  terms  of  the  rigid  body  coonlinates  and  Lagrange 
multipliers,  secondly,  an  explicit  computation  of  the  inverse  dynamic  joint  actuation,  and  thirdly, 
a  forward  dynamic  analysis  wherein  generalized  coordinates  and  Lagrange  multipliers  are 
determined  in  terms  of  the  joint  actuation  and  desired  end-point  coordinates.  In  contrast  with  the 
recursive  methods  previously  proposed,  this  new  method  is  the  most  general  since  it  is  suitable 
for  both  open-chain  and  closed-chain  configurations  of  three-dimensional  multibody  systems. 
The  algorithm  yields  stable,  non-causal  actuating  joint  torques  and  associated  Lagrange 
multipliers  that  account  for  the  constraint  forces  between  flexible  multibody  components. 

Introduction 

The  problem  of  end-point  trajectory  tracking  in  flexible  multibody  systems  has  led  to  the 
development  of  methods  for  inverse  dynamics.  Inverse  dynamics  deals  with  the  problem  of 
determining  the  joint  actuation  that  will  cause  a  specified  control  point  in  the  flexible  multibody 
system  to  follow  a  desired  trajectory.  T..,s  pioneering  work  of  Reference  1  on  the  trajectory 
control  of  a  single  flexible  link  through  inverse  dynamics  showed  that  the  inverse  dynamic  torque 
is  non-causal  with  respect  to  the  end-point  motion,  i.e.,  actuation  is  required  before  the  end-point 
has  started  to  move  as  well  as  after  the  end-point  has  stopped.  Moulin  and  Bayo  [2] 
demonstrated  that  because  of  the  non-minimum  phase  character  of  the  inverse  dynamics  for  the 
trajectory  tracking  problem,  the  only  bounded  solution  for  the  inverse  dynamic  torque  has  to  be 
non-causal.  Bayo,  et.  al.  [3],  extended  the  inverse  dynamics  to  planar,  multiple-link  systems 
using  an  iterative  frequency  domain  approach.  The  recursive  method  proposed  in  that  study  is 
suitable  for  planar  open-chain  systems,  but  required  an  ad  hoc  procedure  for  planar  closed-chain 
systems.  A  time  domain  inverse  dynamic  technique  based  on  the  non-causal  impulse  response 
function  was  presented  by  Bayo  and  Moulin  [4]  for  the  single  link  system,  with  provisions  for 
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extension  to  multiple  link  systems.  An  equivalent  time  domain  approach  for  a  single  link  arm 
was  proposed  by  Kwon  and  Book  [5]  where  the  non-causality  of  the  computed  torque  was 
captured  by  dividing  the  inverse  system  into  causal  and  anticausal  parts. 

In  this  study,  a  general  approach  is  presented  for  the  solution  of  the  non-causal  inverse 
dynamics  of  three-dimensional,  flexible  multibody  systems,  which  is  suitable  for  both  open- 
chain  and  closed-chain  configurations.  With  this  work,  a  methodology  is  presented  that  is 
suitable  for  all  multibody  systems,  ranging  from  the  single  link  case  to  three-dimensional 
systems  with  general  topologies. 

Problem  Formulation 

Consider  an  n-body  flexible  multibody  system  such  as  that  shown  in  Fig,  1.1.  A  typical 
multibody  component,  say  body  /,  is  shown  in  Fig.  1.1  along  with  the  floating  reference  frame 
associated  with  that  body.  The  generalized  coordinates  consist  of  rigid  body  coordinates  qj. 
which  describe  the  position  and  orientation  of  the  floating  reference  frame  associated  with  each 
multibody  component,  and  deformation  coordinates  which  describe  the  deformation  of  the 

flexible  body  with  respect  to  its  floating  reference  frame.  The  rigid  body  coordinates  q{.  consist 
of  the  Cartesian  coordinates  R‘  which  describe  the  position  of  the  origin  of  the  floating  reference 
frame  associated  with  body  /,  and  a  set  of  Euler  parameters  O'  which  describe  the  orientation  of 
the  floating  frame.  The  deformation  from  the  nominal  configuration  is  assumed  to  be  small,  so 
that  the  different  bending  and  torsional  modes  are  decoupled. 

Considering  the  reference  coordinates  q^  =  0^,  qj-  as  generalized  coordinates  for  the 

flexible  multibody  system,  these  coordinates  are  not  independent  because  the  motion  of  specific 
points  in  different  bodies  are  related  according  to  the  type  of  mechanical  joints  that  interconnect 
them.  Moreover,  in  flexible  multibody  systems,  the  deformation  of  a  component  affects  the 
configuration  of  adjacent  components.  As  a  consequence,  the  interdependence  of  the  generalized 
coordinates  is  expressed  by  a  vector  of  kinematic  constraint  equations,  such  as 

O(q.0  =  0  (1.1) 

where  q  is  the  total  vector  of  system  generalized  coordinates,  t  is  time,  and  <D  is  the  vector  of 
linearly  independent  holonomic  constraint  equations.  These  constraint  equations  can  be  further 
classified  into; 

1 .  rigid  body  constraints  where  only  rigid  body  variables  are  involved  in  the  constraint 
equation; 


Fig.  1.1.  Topological  description  of  a  flexible  articulated  structure. 
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2.  joint  constraints  where  both  rigid  body  and  deformation  coordinates  are  included  in  the 
constraint  equation;  and 

3 .  rheonomic  consuuints  wherein  the  constraint  equations  can  be  explicit  functions  of  rime  as 
well  as  generalized  coordinates. 

The  third  type  of  constraint  becomes  active,  for  example,  in  the  case  of  imposing  the 
coordinates  of  the  end-effector  to  follow  a  desired  trajectory. 

Considering  the  rigid  body  and  deformation  coordinates  described  above  as  generalized 
coordinates,  and  following  standard  procedures  in  multibody  dynamics,  the  constrained 
equations  of  motion  become 

M(q)q-t-Cq-HKq-i-<DjX  =  Qe-t-Qv(q,q)  (1.2) 

where  M,  C,  and  K  are  the  system  mass,  damping  and  stiffness  matrices,  respectively,  X  is  the 
vector  of  Lagrange  multipliers  associated  with  the  constraints,  is  the  constraint  Jacobian 
matrix,  is  the  vector  of  applied  external  forces,  and  Qy  is  the  quadratic  velocity  vector.  The 
quadratic  velocity  vector  contains  the  centrifugal  forces  and  Coriolis  forces  that  result  from  the 
differentiation  of  the  kinetic  energy  expression  with  respect  to  the  generalized  coordinates. 
Geometric  stiffening  due  to  high  rotation  rates  can  also  be  added  to  the  vector  Qy , 

In  a  forward  dynamic  analysis,  i.e.,  finding  the  resulting  motion  given  the  applied  joint 
forces  and  external  forces,  Eqs.  (1.1)  and  (1.2)  constitute  a  mixed  system  of  differential- 
algebraic  equations  that  have  to  be  integrated  simultaneously.  The  solution  to  the  inverse 
dynamics  problem  requires  a  forward  dynamic  analysis  within  an  iteration  process.  We  solve  the 
forward  dynamics  problem  by  using  the  augmented  Lagrangian  penalty  formulation  [8-9]. 
Applying  the  augmented  Lagrangian  penalty  formulation  to  Eqs.  (1.1)  and  (1.2)  results  in  the 
following  equation: 

M(q)q-i-Cq-t-Kq-Kl>Ja[o-h2na)<i;-ha)^<D]  =  Qe-i-Qy(q,  q)-<I>JX.*  (1.3) 

where  a  is  a  diagonal  matrix  of  penalty  factors  whose  elements  arc  large  real  numbers  that  will 
assure  the  satisfaction  of  constraints,  co  and  p.  are  diagonal  matrices  representing  the  natural 
frequencies  and  damping  characteristics  of  the  dynamic  penalty  system  associated  with  the 
constraints.  The  augmented  Lagrangian  method  requires  an  iteration  for  the  correct  value  of  the 
Lagrange  multipliers.  The  iterative  equation  for  the  Lagrange  multipliers  is  given  by 

^1+1  =  +  2  p  to  6  -f-  0)^  <1)|. 


(1.4) 
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The  augmented  Lagrangian  penalty  formulation  [8]  has  several  advantages  over  the 
standard  algorithms  used  in  solving  differential-algebraic  equations.  First,  the  method  obviates 
the  need  to  solve  a  mixed  set  of  differential-algebraic  equations  and  aoes  not  increase  the  numN*“ 
of  equations  to  account  for  the  constramts.  Second,  this  method  allows  the  use  of  standard 
unconditionally  stable  algorithms  without  the  need  of  further  stabilization  techniques  to  control 
the  violation  of  constraints  during  the  integration  process.  Third,  the  method  can  handle 
redundant  constraints  and  allows  the  multibody  system  to  undergo  singular  positions.  Fourth, 
the  constraint  forces  (Lagrange  multipliers)  can  be  obtained  as  a  by-product  of  the  uiteftration 
without  having  to  integrate  additional  equations  for  them.  Finally,  the  method  assures 
convergence  independent  of  the  penalty  values  used. 

Inverse  Kinematics  and  Inverse  Dynamics 

The  three-dimensional  inverse  dynamics  problem  for  either  open-chain  or  closed-chain 
topologies  is  solved  by  an  iterative  Lagrangian  procediu^.  Our  overall  strategy  is  to  first  solve 
the  inverse  kinematics  problem,  i.e.,  finding  the  unknown  rigid  body  coordinates  qr  and  flexible 
body  displacements  qy,  given  the  desired  end-point  coordinates  as  explicit  functions  of  time. 
Having  determined  the  correct  generalized  coordinates  and  their  time  derivatives,  the  inverse 
dyna’  .  joint  torques  can  be  obtained  explicitly  from  the  equations  of  motion.  Compared  to  the 
recursive  procedures  previously  proposed,  this  new  approach  is  more  systematic  and  becomes 
the  only  choice  when  closed-chain  systems  are  encountered.  The  elastic  links  are  modeled  under 
pinned-pinned  boundary  conditions.  Furthermore,  since  torsional  deformations  cause  deviations 
from  the  nominal  configuration  further  down  the  chain,  we  model  the  elastic  link  as  fixed  with 
respect  to  torsion  at  the  distal  end  of  the  link. 

Our  goal  then  is  to  formulate  an  inverse  kinematics  equation  that  is  linearized  about  the 
nominal  motion,  so  that  the  elastic  displacements,  which  are  non-causal  with  respect  to  the  end¬ 
point  motion,  can  be  determined  through  a  transformation  to  the  frequency  domain.  This  is 
possible  only  if  the  leading  matrix  of  the  linearized  equation  is  time-invariant  and  if  the  forcing 
term  is  Fouiier  transformable.  This  objective  has  been  achieved  in  the  planar  case  with  the  use  of 
reference  coordinates  for  the  rigid  body  variables  to  describe  the  position  and  orientation  of  the 
floating  reference  frame  [6]. 

The  three-dimensional  inverse  kinematics  problem  presents  additional  difficulties  not 
found  in  the  planar  case  [7].  Fust,  unlike  the  planar  case,  the  three-dimensional  torque  vectors 
change  directions  in  time,  so  that  the  external  force  vector  in  Eq.  (1.3)  becomes  a  nonlinear 

function  of  the  rigid  body  orientation  coordinates.  To  overcome  this  difficulty,  a  proper 
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parametrization  of  the  rigid  body  coordinates  and  proper  bases  for  the  joint  torques  are  necessary 
to  attain  the  stated  objectives  in  forming  the  linearized  inverse  kinematics  equations.  The  desired 
form  of  the  linearized  inverse  kinematics  equation  is  possible  if  Euler  parameters  arc  used  to 
describe  the  rigid  body  orientation  and  if  the  base  torque  vector  of  each  multibody  component  is 
expressed  in  terms  of  components  along  the  associated  floating  reference  flame. 

For  a  typical  multibody  component,  say  body  i,  the  equations  of  motion  can  be  written  in 
the  following  partitioned  form  [7]: 
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The  elemenis  of  the  mass  matrix  and  quadratic  velocity  force  vector  corresponding  to  an 
isoparametric,  three-dimensional  curved  beam  finite  element  are  given  in  Reference  7. 

Let  t'  be  the  torque  vector  at  the  base  of  body  »,  whose  three  components  xj.,  x^  and  x{ 
are  parallel  to  the  associated  floating  reference  axes  r*,  s',  and  r* ,  respectively.  If  we  use  Euler 
parameters  as  the  rigid  body  orientation  coordinates,  the  externally  applied  joint  forces 
associated  with  the  rigid  body  rotation  of  body  i  can  be  expressed  as 

Q'q  =  [g‘]^  x'^  j  (1.6) 

where  x'  is  the  base  torque  acting  on  body  i  and  whose  components  are  parallel  to  the  floating 
reference  axes  associated  with  body  /;  x'"'’*  is  the  vector  of  joint  torques  and  reactic.i  moments 
transmitted  from  body  i  to  body  i  +  1,  and  whose  components  are  parallel  to  the  floating 
reference  axes  associated  with  body  j  +  1;  A'  and  are  body  axes  to  inertial  axes  rotation 
transformation  matrices  for  bodies  i  and  i  +  I,  respectively;  and  G‘  is  a  matrix  that  maps  the 
derivatives  of  the  Euler  parameters  describing  the  orientation  of  the  reference  frame  of  body  i  to 
the  angular  velocity  of  this  reference  frame,  and  is  given  by  G‘  =  2  E*.  Combining  Eq.  (1.6) 
with  the  second  set  of  equations  in  Eq.  (1.5),  and  after  and  involved  process  described  in 
Reference  7,  the  inverse  dynamic  equations  are  obtained  as: 
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•t‘  =  [A‘f  A'**  x‘*‘  +  -J  G'  R‘  +  J‘  G'  e*'  +  }‘f  qy  +  -J  G'  4>^i  X. 

4&[Gf{j-G‘e4jy<,y} 

where  J*  is  the  3  X  3  inertia  tensor  of  body  i  with  respect  to  the  origin  of  the  floating  reference 
frame  and  measured  relative  to  this  frame,  and  is  the  inertir  matrix  coupling  the  rigid  body 

rotation  and  the  elastic  deformation.  The  key  to  obtaining  a  time-in\  oriant  leading  matrix,  »hat  is 
necessary  in  transforming  the  linearized  equations  of  motion  into  the  frequency  domain,  is  the 
fact  that  the  inertial  coupling  matrix  Jy-  can  be  decomposed  into  the  sum  of  a  time-invariant 

matrix  and  a  time-varying  matrix,  i.e.. 


where  and  are  the  time-invariant  part  and  time-varying  part  of  Jf,  respectively.  This 
decomposition  is  essential  m  the  formulation  of  the  inverse  kinematics  equations  that  lead  to  non- 
causal  solutions  of  the  nonlinear  inversion  problem  and  which  are  given  by 

mff  q/  +  qff  q‘/  =  F‘  (X,  qj.,  q*.,  q{.,  q^,  q^,  q^)  (1.9) 

where 

=  4  (1.10) 
and  ihe  motion-induced  force  vector  acting  on  the  elastic  degrees  of  freedom  is  given  by 
F*  =  nJ  |[a‘]^  [a*]^  A'""^ 

^  (1.11) 

The  modified  mass  matrix  m  j  i?  non  symmetric  and  it  is  precisely  this  non  symmetry 
that  produces  elastic  displacements  which  are  non-causal  with  respect  to  the  end-point  motion 
when  non-causal  techniques  are  employed  to  obtain  the  proper  inversion  of  the  nonlinear,  non¬ 
minimum  phase  systems. 
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The  non-causal  inversion  can  now  be  carried  out  efficiently  in  the  frequency  domain  since 
the  leading  matrices  have  been  constructed  such  that  they  remain  constant  throughout  the  motion. 
Equation  ( 1.9)  can  be  written  as  a  set  of  complex  equations  for  a  particular  frequency  O) 


.  10)  (1)'^  " 


qy(o))  =  F^((o) 


(1.12) 


where  qy-Cco)  is  the  Fourier  transform  of  q^ft)  and  F'fco)  is  the  Fourier  transform  of  F'(t). 

Alternatively,  the  computation  of  the  elastic  displacements  and  their  derivatives  in  each 
iteration  can  also  be  carried  out  in  the  time  domain  through  the  use  of  the  non-causal  impulse 
response  function  and  the  bilateral  Laplace  transfOTtn,  e.g.. 


q/(t)=  7  I  h/t-t)//x)ut 

-ooj=l  tl.lj) 

where  hj(t)  is  the  non-causal  acceleration  response  vector  to  an  impulse  applied  to  the  degree 
of  freedom  and  f  y(t)  is  the  jth  component  of  the  forcing  term  on  the  right  hand  side  of  Eq.  (1.9). 
We  note  that  the  integration  from  -«o  to  » is  neojssary  to  capture  the  non-causal  effects. 

Once  the  non-causal  elastic  displacements  and  their  derivatives  arc  known,  Eq.  (1.7)  can 
be  used  to  explicitly  compute  the  non-causal  inverse  dynamics  joint  efforts  that  will  move  the  end 
effector  according  to  a  desired  trajectory.  We  note,  however,  that  the  joint  torques  and  elastic 
displacements  given  by  Eqs.  (1.7)  and  (1.9),  respectively,  depend  on  the  Lagrange  multipliers 
and  rigid  body  coordinates,  which  in  turn  depend  on  the  elastic  displacements  and  the  applied 
torque.  Moreover,  the  rigid  body  coordinates  and  Lagrange  multipliers  are  different  from  their 
nominal  values  when  the  components  of  the  multibody  system  arc  flexible.  Therefore,  a  forward 
dynamic  analysis  is  required  to  obtain  an  improved  estimate  of  the  generalized  coordinates  and 
Lagrange  multipliers. 

Results 

Figure  1.2  shows  a  closed-chain,  three-dimensional  flexible  articulated  structure,  where 
the  selected  control  torques  are  shown  in  the  figure.  Joints  1-4  are  revolute  joints  while  joint  5  is 
a  spherical  joint  The  desired  end-point  (joint  5)  trajectory  is  a  motion  in  the  X2  -  X3  plane  with 
the  X2  coordinate  and  X3  coordinate  of  the  end-point  following  the  trajectories  shown  in  Fig. 
1 .3.  The  four  links  share  the  following  geometric  and  material  properties: 


Fig.  12.  Close<hmn  3-D  Flexible  articulated  structure. 


Joint  Torque  T3  (N-m) 


Fig.  1.4.  Inverse  dynamics  torques. 


Link  ^2  transverse  def.  along  t  (rn 


-0.0  0.5  1.0  1.5 

Time  (sec.) 


internal  point. 


Fig.  1 J.  Elastic  direction  in  an 
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Length:  1.0m;  Cross  section  dimensions:  1.0  cm  x  1.0  cm;  Young's  modulus:  40  GPa; 
Shear  modulus:  15  GPa;  Mass  density:  2715  kg/m3;  Tip  mass:  0.1  kg. 

The  procedure  is  applied  to  the  closed-chain  system  by  introducing  a  cut  at  the  end-point 
(joint  5),  thus  creating  two  open-chain  systems.  The  internal  constraint  forces  exposed  by  the 
cut  are  automatically  taken  into  account  by  the  Lagrange  multipliers  in  the  equations  of  motion. 
Figures  1.4a  and  1.4b  show  joint  torques  T2  and  T3,  respectively,  that  are  needed  to  achieve  the 

desired  end-point  trajectory.  In  these  figures,  the  dashed  curves  refer  to  the  inverse  dynamic 
torques  obtained  by  the  present  procedure  while  the  solid  curves  refer  to  the  corresponding  rigid 
body  torques.  Observe  the  pre  and  post  activation  required  by  the  inverse  dynamics  situation. 
Figure  1.5  shows  the  transverse  deflections  at  a  third  point  in  link  #2,  obtained  from  a  feed¬ 
forward  of  the  inverse  dynamic  torques  (dashed  curve)  and  the  corresponding  deflection  obtained 
from  a  feed-forward  of  the  rigid  body  torque  (solid  curve).  It  may  be  observed  that  the  inverse 
dynamic  torques  minimize  the  residual  structural  vibrations  that  are  otherwise  present  when  rigid 
body  torques  are  used  to  actuate  the  system. 

Conclusion 

A  new  and  general  procedure  for  determining  the  inverse  dynamics  and  kinematics  of  2-D 
and  3-D  flexible  articulated  stmetures  and  multibody  systems  has  been  developed.  An  iterative 
procedure  is  necessary  because  of  the  interdependence  between  the  elastic  coordinates,  the  rigid 
body  coordinates  and  the  associated  Lagrange  multipliers  in  the  system  equations  of  naotion.  The 
procedure  is  general  since  it  is  valid  for  both  open-chain  and  closed-chain  configurations,  and 
differs  from  the  previously  proposed  recursive  methods  in  the  sense  that  the  rigid  body 
coordinates  are  not  assumed  to  follow  the  nominal  motion.  The  conditions  for  trajectory  tracking 
are  now  met  in  a  more  general  way  through  the  satisfaction  of  rheonomic  constraint  conditions. 
For  closed-chain  systems,  the  new  method  is  the  only  valid  procedure  for  determining  the 
inverse  dynamic  torques  since  in  this  case,  the  number  of  control  torques  is  smaller  than  the 
number  of  joints  and  therefore,  the  recursive  methods  can  not  be  applied. 
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2.  Inverse  Dynamics  of  Nonlinear  Articulated  Structures: 

Simultaneous  Trajectory  Tracking  and  Vibration  Reduction. 

Summary 

The  problem  of  inverse  dynamics  for  flexible  multibodies,  which  arises,  in  trajectory 
tracking  control  of  flexible  multibodies  such  as  space  manipulators  and  articulated  flexible 
structures  is  studied.  Previous  research  has  resolved  this  trajectory  tracking  problem  by 
computing  the  system  inputs  for  feed-forward  control  of  actuators  at  the  joints.  Recently,  the  use 
of  distributed  actuators  like  electro-strictive  actuators  in  flexible  structures  has  introduced  a  new 
dimension  to  this  trajectory  tracking  problem.  In  this  paper  we  optimally  utilize  such  actuators  to 
aid  joint  actuators  for  tracking  control,  and  introduce  a  new  inverse  dynamics  scheme  for 
simultaneously  (1)  tracking  a  prescribed  trajectory  and  (2)  minimizing  ensuing  elastic  deflections. 
We  apply  this  scheme  for  trajectory  tracking  of  a  two-link  two-joint  planar  manipulator  with  joint 
motors  and  distributed  electro-strictive  actuators.  Experimental  results  are  presented  to  contrast 
our  new  scheme  with  other  existing  methods.  The  study  summarizes  the  work  done  under  the 
contract  and  described  in  references  14  and  15. 

Introduction 

Inverse  dynanucs  provides  an  excellent  means  for  trajectory  tracking  of  flexible 
multibodies.  Methods  to  pie  c<^mpute  the  actuator  inputs  required  to  exactly  track  a  given  output 
trajectory  of  a  control  point  on  open-chain  flexible  multibodies  have  been  developed  [1-5]  where 
the  inverse  dynamics  and  kinematics  produce  bounded  feed-forward  inputs  for  actuators,  like 
motors  at  articulations  and  joint  angles,  to  track  a  reference  point  on  the  structure.  The  closed- 
chain  cases  have  been  recently  presented  by  Ledesma  and  Bayo  [6-7].  If  the  sensors  and 
actuators  are  non-coUocated  then  the  flexible  structure  has  nonminimum  phase  dynamics  and  the 
only  stable  inverse  dynamics  solution  to  the  tracking  problem  is  non-causal  [2].  Once  a 
trajectory  is  specifled,  the  feed-forward  control  input  obtained  by  inverse  dynamics  for  exact 
trajectory  tracking,  has  a  uiuque  bounded  solution.  Therefore,  the  subsequent  elastic  structural 
vibrations  induced  on  the  structure  (except  at  the  control  point  where  these  vibrations  are  zero) 
during  the  trajectory  tracking  motion  are  also  defined  uniquely.  These  vibrations  could  be 
detrimental  to  the  performance  of  sensitive  on-board  systems  and  hence  it  is  desirable  to 
minimize  them.  For  some  time,  distributed  actuators  have  been  successfully  used  to  control 
structural  vibrations  [10-11].  Recent  success  in  their  experimental  use  [12]  motivates  the  use  of 
such  actuators  to  aid  joint  actuators,  like  motors,  for  trajectory  tracking. 


The  trajectory  tracking  objective  can  be  accomplished  by  the  point  actuatOTS  alone  [13]  and 
in  this  sense  the  distributed  actuators  are  redundant  In  this  study  (which  summarizes  the 
contribution  of  references)  the  concept  of  using  the  extra  actuation  available  through  the 
distributed  actuators  in  the  structure  is  introduced  to  not  only  satisfy  the  trajectory  tracking 
constraint,  but  also  minimize  the  accompanying  elastic  displacements  during  the  motion.  A  new 
inverse  dynamics  method  is  presented  to  compute  the  feed-forward  inputs  which  includes  the 
cases  of  redundantly  actuated  structures.  This  use  of  distributed  actuators  for  end  effector 
trajectory  control  is  contrasted  with  the  use  of  only  the  joint  actuators  in  feed-forward.  The 
method  proposed  is  shown  to  substantially  reduce  the  induced  vibrations  in  the  structure.  The 
results  are  experimentally  verified  using  a  flexible  two  link  articulated  truss  structure  with 
distributed  electro-strictive  actuators  and  joint  motors. 

Formulation 

The  inverse  dynamics  of  a  flexible  multibody  is  a  nonlinear  problem.  We  solve  this  non¬ 
linear  problem  recursively,  one  element  of  the  multibody  at  a  time.  This  algorithm  proposed  by 
Bayo  et  al  in  [3],  for  general  multibody  inverse  dynamics  involves,  1)  studying  an  individual 
component  (link)  in  the  chain;  2)  coupling  the  equations  of  the  individual  links;  and  3) 
recursively  converging  to  the  desired  actuator  inputs  and  corresponding  displacements. 
Following  this  general  procedure,  a  new  scheme  is  proposed  which  incorporates  distributed 
actuators  in  the  solution  of  the  inverse  dynamics  problem  [14-15].  This  approach  leads  to  the 
following  inverse  dynamics  equations  in  terms  of  joint  torque  and  piezoelectric  voltages: 

Mz  +  [C + Cc(O)/,)]  z + [K + Kc(a/„co;,)]  z  =  Bj-T  -i-  + F  (2.1) 

where  z  is  an  R'*  vector  of  the  finite  element  degree  of  freedom.  M  and  K  belong  to  and 
are  the  conventional  finite  clement  mass  and  stiffness  matrices  respectively;  and  e  R"^" 
and  are  the  time  varying  (Zoriolis  and  centrifugal  stiffness  matrices,  respectively.  The  R"^" 
matrix  C  represents  the  internal  viscous  damping  of  the  material.  T  is  the  unknown  joint 
actuation  F  €  R'*  contains  the  reactions  at  the  end  of  the  link,  and  the  known  forces  produced  by 
the  rotating  frame  effect  The  distributed  actuator  inputs  €  R"^  are  the  equivalent  nodal 
forces  at  the  FEM  degrees  of  freedom,  where  np  is  the  number  of  distributed  actuator  inputs. 
Bj  and  Bp  arc  constant  matrices  inputs  influence  matrices  of  dimensions  R"  and  R"^'^, 
respectively. 

The  requirement  is  to  accurately  track  the  end  effector  of  the  link  along  the  given  nominal 
trajectory  without  overshoot  and  residual  vibrations.  If  the  distributed  actuators  were  not 
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available  then  the  exact  tip  trajectory  tracking  requirement  defines  the  joint  input  torque  T.  The 
objective  is  to  use  the  additional  actuation  available  through  the  distributed  actuators  to  reduce  the 
ensuing  structural  vibrations  at  locations  away  from  the  control  point  during  this  motion  by 
minimizing  J(T,  Vp),  a  measure  of  elastic  deflections  in  the  structure  defined  as  follows: 


J(T.Vp)A7z(t/2(t)dt  (2.2) 

— oo 

Mathematically  the  objective  can  be  stated  as 

min  _J(T.Vp)  (2.3) 

(T.Vp)eT  ^ 

Where  T  is  the  set  of  all  pairs  of  stable  actuator  inputs  that  when  used  to  actuate  the  system 
defined  by  Eq.  (2. 1)  yields  Zf(t)  =  0  for  all  t. 

The  solution  process  starts  by  rewriting  Eq.  (2.2)  as 


Mz  +  Cz+  Kz  =  Bj'T  +  Bp  Vp  + F  — Cj.(o);i)  z  —  !C^(ot^,0}/|)  z  (2.4) 

where  the  time  dependent  Coriolis  and  centrifugal  terms  are  kept  on  the  RHS  of  the  equation. 
The  Iteration  procedure  starts  with  the  absence  of  the  last  two  terms  involving  C^.  and  in  the 
right  hand  side.  Then,  the  system  of  equations  can  be  transformed  into  independent  sets  of 
simultaneous  complex  equations  by  means  of  the  Fourier  transform.  For  each  of  the  evaluation 
frequency  o),  Eq.  (2.5)  becomes 


m+t^c--4k 


ICO 


C0‘ 


h 

t 

b 

= 

0 

+ 

f;- 

+ 

0 

Ft 

[bJ 

(2.5) 


A 

where  the  symbol  a  stands  for  Fourier  transform,  and  F  represents  the  known  forcing  terms. 
After  the  first  iteration  it  will  also  include  the  updated  contributions  from  the  Coriolis  and 
centrifugal  terms  appearing  in  the  RHS  of  Eq.  (2.4).  For  any  co  ^  0,  the  matrix 


M  +  7^C- 
1(0 


(2.6) 
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is  a  complex,  symmetric  and  invertible  matrix.  For  co  =  0  the  system  undergoes  a  rigid  body 
motion  and  H  ^  M,  the  positive  definite  invertible  mass  matrix.  Let  G  -  Then  a 

relationship  between  the  joint  actuation  and  the  distributed  actuator  inputs  and  is  obtained  [14-15] 
as: 

t  =  -G7*‘[G,*G^G„]  (F  +  B,  Vp)  (2.7) 


Substituting  this  expression  for  the  input  hub  torque  in  Eq.  (2.3)  and  using  the  property  that 
z  =  -0)^z  yields 


z  =  -i(AV„  +  B) 

(2.8) 

where 

A^  [=Gr;GB7.(G,;^GrtG„)  +  G]B^ 

(29) 

and 

B^  [=Gr/,'GB7<G,AG„G„)  +  G]F 

(2.10) 

Next  using  Parseval's  theorem  the  piezo-electric  voltages  are  obtained  as  [14-15]: 


Vp  =  -(A*A)"^A*B  (2.11) 

where  the  symbol  *  denotes  conjugate  transpose.  A  sufficient  and  necessary  condition  for  A  to 
have  rank  np  is  given  in  references  [14-15]. 

The  distribution  of  actuation  effort  depends  on  the  mode-shapes  of  the  structure  and  the 
actuator  placement,  and  can  be  modeled  easily  using  FEM  [16].  In  the  problem  at  hand,  the  best 
placement  for  a  given  trajectory  depends  not  only  on  the  structure  but  also  the  component 
frequencies  of  the  desired  motion.  Thus  the  optimal  placement  of  the  actuator  would  in  general 
be  trajectory  dependent  In  most  structures,  such  a  freedom  of  changing  the  actuator  placement 
with  the  prescribed  trajectory  is  not  available.  The  design  of  such  structures  with  fixed  actuator 
placements,  is  based  on  minimizing  the  induced  structural  vibrations  over  sets  of  disturbances 
with  a  specified  energy  [16].  In  systems  where  the  required  motions  are  largely  repetitive,  the 
actuator  placement  can  be  optimized  over  a  specified  set  of  trajectories;  this  warrants  a  separate 
treatment  to  be  considered  in  a  future  work. 


Fig.  2.1.  Experimental  set  up. 


14 


Experimental  Verification 

An  experimental  truss  structure  developed  at  UCSB  is  shown  in  Figure  2.1.  The 
structure  has  16  spans  and  two  articulations  forming  a  planar  manipulator.  The  trusses  are  made 
of  aluminum  and  have  lumped  masses  (net  2  Kgs  for  each  link)  distributed  along  their  lengths  in 
order  to  lower  the  links  modal  frequencies  and  hence  the  control  sample  rate.  In  addition,  the 
first  (.base)  and  the  second  links  have  tip  loads  of  3.5  and  1  Kgs  respectively.  These  loads 
further  increase  the  flexibility  of  the  structure  and  the  natural  frequencies  of  the  first  and  second 
links  with  clamped  free  boundary  conditions  are  0.6  Hz  and  1.2  Hz  respectively.  Actuation 
consists  of  low-inertia  dc-motors  at  the  two  joints  and  an  active  bay  (Figure  2.2)  with  four 
electro-strictive  actuators.  Sensing  consists  of  resolvers  at  the  joints  and  collocated  strain 
sensors  on  the  four  electro-strictive  actuators.  In  addition,  an  optical  sensor  measures  the 
position  of  an  infra-red  LED  mounted  about  the  midpoint  of  the  first  link,  thus  providing 
information  of  the  induced  structural  vibrations  during  the  tracking  operation.  The  entire 
structure  is  supported  on  air  bearings  and  controlled  with  an  Intel  386-based  PC,  servo 
amplifiers  for  the  motors  and  150V  servo  amplifiers  for  the  electro-strictive  actuators. 

To  evaluate  the  proposed  use  of  distributed  actuators,  we  apply  it  to  track  the  end-effector 
of  the  two  link  flexible  articulated  structure.  The  desired  trajectory  of  the  end-effector  is  a  series 
of  rest  to  rest  tixitions,  while  the  first  link  is  stationary.  The  nominal  motion  of  the  second  link  is 
shown  in  Figure  2.3.  The  objective  is  to  track  the  desired  trajectory  and  minimize  the  vibrations 
in  the  first  link  which  is  equipped  with  electro-strictive  actuators.  To  evaluate  the  vibration 
reduction  achieved,  the  following  tracking  experiments  arc  conducted:  (1)  feed-forward  of 
torques  computed  without  invene  dynamics,  i.e.  assuming  the  links  to  be  rigid;  (2)  using  the 
torques  computed  by  inverse  dynamics  for  only  the  joint  actuators;  and  (3)  incorporating  the 
distributed  electro-strictive  actuators  on  the  first  truss  along  with  joint  actuators  in  the  inverse 
dynamics  computation  and  using  these  as  feed-forward.  In  each  case  a  joint  based  PD  controller 
was  used  for  controlling  cnors  due  to  unmodeled  dynamics,  friction  and  other  modeling  errors. 

Plots  of  the  inputs  to  he  electro-strictor  and  joint  motors  arc  presented  in  Figures  2.4  and 
2.5.  Note  that  the  actuations  start  before  the  tip  trajectory  begins.  This  non-causality  due  to  the 
propagation  delays  is  reduced  when  additional  actuation  is  available  through  the  piezos  as  seen  in 
Figure  2.5.  To  illustrate  the  viability  of  the  proposed  method  we  plot  the  transverse  structural 
deflections  at  the  midpoint  of  r!;e  first  link  (Figure  2.6)  during  the  motion  obtained  by  an  infra¬ 
red  led  mounted  on  the  structure  and  an  over-head  optical  sensor.  These  elastic  deflections  in  the 
structure  are  considerably  reduced  when  electro-strictive  actuators  are  also  used  in  addition  to  the 
joint  motors.  On  the  contrary  if  inverse  dynamics  is  not  used  and  the  rigid  body  torques  are  used 
then  the  resulting  motion  has  much  larger  vibrations. 
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Fig.  2.4.  Command  Strain  input  to  the  electro-strictor. 
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Thus  the  incorporation  of  electro-strictive  actuators  results  in  a  significant  reduction  in  the 
structural  vibrations  and  demonstrates  the  viability  of  the  proposed  method  The  consequent 
reduction  (50%)  in  the  induced  vibrations  of  the  structure  allows  the  use  of  lighter  elements  and 
therefore  smaller  joint  actuators,  especially  in  space  structures  where  the  loads  are  mainly  inertial. 

Conclusion 

Typically  distributed  actuators  like  the  electro-strictive  ones  cannot  gamer  enough 
actuation  to  cause  large  motions  in  the  multibody  system.  However  they  could  be  very 
effective  in  reducing  structural  deformation.  To  reduce  such  vibrations  by  the  use  of 
distributed  actuators  in  feedforward  aiding  joint  actuators  for  trajectory  tracking  is  a  novel 
idea  developed  in  this  study.  The  method  proposed  is  extremely  efficient  as  it  optimally 
reduces  structural  vibrations  and  the  theory  developed  was  verified  by  experiments.  The 
use  of  the  redundant  distributed  actuators  seems  promising  in  the  slewing  control  of 
flexible  manipulators  and  other  space  structures,  and  motivates  further  work  on  distributed 
actuators  for  the  control  of  flexible  multibodies. 


3.  Buckling  Control  of  a  Flexible  Beam  Using  Piezoelectric 
Actuators 

Summary 

A  new  application  of  piezoelectric  actuation  is  described  for  enhancing  the  load 
capacity  of  a  beam  under  compression.  By  feedback  control,  the  first  buckling  mode  is 
stabilized  and  the  buckling  load  is  dramadcally  increased  to  the  critical  load  of  the  second 
buckling  mode  of  the  beam.  The  approach  uses  a  truncated  modal  model  of  the  beam, 
distributed  piezo  actuators,  and  strain  gauges  for  feedback. 

Introduction 

Active  damping  and  control  of  flexible  structures  has  been  an  area  of  research  focus 
for  some  time.  However,  the  recent  application  of  distributed  piezoelectric  actuators  to 
structure  control  by  Crawley  and  De  Luis  [17],  Bailey  and  Hubbard  [18],  and  Rsher  [20] 
has  posed  new  and  challenging  problems.  Following  the  initial  experiments  of  these 
researchers,  where  a  single  vibrational  mode  is  controlled,  Fisher  addressed  the  actuator 
placement  problem  to  control  several  modes. 

We  address  the  new  problem  of  buckling  control  using  smart  materials.  In  contrast 
to  the  dynamic  stability  issues  of  vibration  control,  buckling  is  a  static  instability  of  axially 
loaded  members  of  a  structure.  It  is  well  known  that  as  the  axial  compressive  load  P  in  an 
initially  straight  beam  increases,  the  beam  remains  straight  and  undeformed  until  the  load 
reaches  a  certain  critical  value  Pcr,i>  where  the  stable  equilibrium  of  the  first  bending  mode 
bifurcates  into  one  unstable  and  two  stable  equilibria  (pitchford  bifurcation).  The  two 
stable  equilibria  correspond  to  buckled  configurations. 

Here  we  use  piezoelectric  actuators  and  strain  gauge  sensors  to  show  that  buckling 
of  a  simply  supported  beam  can  be  postponed  beyond  the  first  critical  load.  The  load 
deflection  characteristic  for  large  deflections  of  a  beam  in  a  buckled  configuration  is  highly 
nonlinear  and  involves  numerical  solution  of  elliptic  integrals.  Figure  3.1a  shows  a  typical 
load  deflection  curve  where  Pcr^i  is  the  buckling  load  of  the  nth  mode.  If  P  <  Pcr.i.  the 
undeflected  beam  is  stable.  For  Pcrji  <  P  <  Pcrn+l  ^  modes  are  stable  except  for  the  first 
n  bending  modes.  The  idea  reflected  in  our  work  is  the  use  of  feedback  control  in 
conjunction  with  piezoactuators  to  stabilize  the  first  bending  mode  beyond  Pcr,i  and  achieve 
a  bifurcation  diagram  of  the  form  shown  in  Fig.  3.1b,  where  the  buckling  force  Pcr.i  is 
greater  than  that  for  the  uncontrolled  beam. 

We  use  the  linearized  equations  of  motion  and  the  associated  modal  equations  of  a 
simply  supported  flexible  beam  with  piezoelectric  actuators  subjected  to  slowly  varying 


axial  load.  A  finite-dimensional  state-space  model  is  then  derived  for  a  reduced  order 
system  and  a  controller  is  designed  to  increase  the  stiffness  of  the  first  bending  mode  to 
exceed  that  of  the  second  bending  mode. 


(a)  (b) 


Fig.  3.1.  Load  deflection  curve  of  a)  uncontrolled  beam  and  b)  controlled  beam. 
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Fig.  3.2.  Singly  supported  column  with  piezoelectric  actuators. 

Problem  Formulation 

We  use  a  truncated  modal  expanstion  of  the  deflection  of  a  beam  to  derive  a  linear 
finite-dimensional  model.  We  emphasize  that  the  beam  is  assumed  to  be  uniform  with  no 
manufacturing  imperfections.  Since  the  aim  is  to  stabilize  the  beam  in  its  straight 
configuradon,  it  is  natural  to  assume  small  deflecdons  and  linearize  the  equations  of  motion 
about  this  configuration.  Note  that  the  strain  induced  by  ppezoelectric  actuators  is  usually 
small  and,  therefore,  the  small  deflection  assumption  is  consistent  with  the  capacity  of  the 
actuators. 

Figure  3  2  shows  a  simply  supported  uniform  beam  with  piezoelectric  actuators  of 
equal  thickness  bonded  to  both  sides  by  a  suitable  adhesive.  The  beam  of  width  b  and 
thickness  tb  is  subjected  to  an  axial  compressive  load,  and  control  moments  are  applied  by 
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the  piezoactuators.  The  actuator  being  modeled  is  a  piezoelectric  polymer,  poly  vinylidene 
fluoride.  For  an  axially  polarized  piezo,  a  voltage  applied  across  its  thickness  results  in 
strain  along  its  length.  For  simplicity  the  width  of  each  piezolayer  is  assumed  to  be  the 
same  as  that  of  the  beam. 

The  strain  Aj  developed  in  an  unconstrained  piezo  is  given  by  Aj  =  Vi(t)d3i/  tp 
where  V|(t);  i  =  1,  2  is  the  voltage  applied  to  the  ith  piezostrip,  d3i  the  piezoelectric  strain 
constant,  and  tp  the  thickness  of  the  piezolayer.  If  Vj  and  V2  are  the  voltages  applied  on 
the  top  and  bottom  piezolayers,  respectively,  and  Ep  is  the  Young's  modulus  of  the  piezo, 
the  resulting  moment  on  the  piezobeam  segment  is  given  by 


2  ®  2 


M  =  bEptp(A2-Ai)| 

=  bEpd3  +  ta  +  ^  I  (V2  -  Vj)  A  k*  ( V2  -  Vj) 


(3.1) 


The  equation  of  motion  of  the  beam  can  be  derived  using  Hamilton's  variational  principle. 
Under  small  deflection  assumption  Hamilton’s  principle  yields 

pAy  +  (EIy")  +  (Py')'=M[8’(x-X2)-5'(x-xi)]  (3.2) 

where  p  is  the  density  of  the  beam;  y  is  the  transverse  deflection;  y  and  y’  are  the  time  and 
spatial  derivatives  of  y,  respectively;  El  is  the  stiffness  of  the  beam;  A  is  the  cross-sectional 
area  of  the  beam;  5'  is  the  spatial  derivative  of  the  delta  function;  and  xi  and  X2  are  the 
locations  of  the  two  ends  of  the  piezolayer. 

Modal  states  are  estimated  from  strain  gauge  measurements  at  discrete  locations.  It 
is  easy  to  see  that  observability  of  the  modes  of  the  system  depends  on  the  location  of  the 
sensors;  a  mode  with  its  node  at  the  location  of  a  strain  gauge  is  unobservable  with  that 
sensor.  To  reduce  the  number  of  sensors,  modal  control  of  flexible  structures  is  usually 
based  on  the  first  few  modes  of  vibration.  This  is  justified  by  the  fact  that  higher 
vibrational  modes  are  in  general  difficult  to  excite  and  have  higher  structural  damping. 
However,  the  unmodeled  dynamics  can  cause  instability  through  what  are  known  as 
control  and  observation  spillover.  It  has  been  shown  that  both  control  and  observation 
spillover  of  unmodeled  modes  are  necessary  to  cause  instability  in  a  closed-loop  system. 

The  sensors  are  placed  so  that  the  second  and  third  modes  and  their  multiples  are 
unobservable.  Thus  the  first  and  fifth  modes  are  the  first  two  modes  in  a  minimal 
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realization  of  the  system.  We  ignore  higher  order  modes  and  discuss  the  associated 
spillover  problems  in  [20].  If  a  small  amount  of  structural  damping  is  present,  all  of  the 
unobservable  modes  remain  stable  even  in  the  presence  of  spillover.  Similarly  the 
dynamics  of  the  even  modes  are  not  affeaed  by  the  control,  and  hence  they  remain  stable. 

We  model  the  sensors  as  follows.  The  bending  moment  at  the  location  of  a  strain 
gauge,  a  distance  x  from  the  lift  end  of  the  beam,  is  given  by 

Mb  =  -Eble,y"(x,t)  =  Ebl«,  p  T  >!„  sin  (3.9) 

w  here  Eb  is  the  Young's  modulus  of  the  beam  material  and  leq  is  the  equivalent  moment  of 
inertia  of  the  composite  piezo-beam  segment  based  on  beam  material.  The  resulting  strain 
in  a  strain  gauge  attached  to  one  side  of  the  beam  is 

P  (3.10) 

^eq^b 


where  Agq  is  the  equivalent  area  based  on  beam  material.  The  sign  of  the  first  term  in  Eq. 
(3. 10)  depends  on  which  side  of  the  beam  the  strain  gauge  is  bonded  to.  Therefore,  the 
output  of  a  differential  strain  gauge  is  independent  of  the  load  P  and  is  given  by 


Vj  =  2  e  kg  =  2 


[2,  [(tb/2)  +  ta  +  tpl«^ 

n  I? 


n=oo 

I 


n=l 


X  n  Tin  sm 


.  /^n7tx'\ 

■"l—J 


Aks  I 

n=l 


n^Tln 


“l—J 


(3.11) 


where  kg  is  the  strain  gauge  constant  If  differential  strain  gauges  are  placed  as  x  =  173  and 
x  =  2173,  and  the  sum  of  their  measurements  is  taken  as  the  system  output  we  have 


'o^''s(x  =  y)^-v,(x  =  ^) 

=  k,  i  n^Ti„^sin-Jy-»-sin^j 


(3.12) 
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Results 

The  modal  model  developed  above  was  truncated  to  form  the  actuated  beam  model  and 
sensor  model.  A  standard  linear  quadratic  regulator  was  then  designed.  We  showed  that 
the  buckling  of  a  flexible  beam  can  be  postponed  beyond  the  first  critical  load  by  means  of 
feedback  using  piezoelectric  actuators  and  strain  gauge  sensors  [20].  It  is  observed  that  a 
controller  design  based  on  a  fixed  axial  load  Pmax  stabilizes  the  modeled  modes  for  any  P  ^ 
Pmax  and,  therefore,  is  robust  to  slow  load  variations.  Hence  buckling  in  the  first  mode  is 
inhibited,  and  the  beam  can  support  a  load  up  to  the  second  critical  load.  Actuator  and 
sensor  placement  is  discussed  with  regard  to  problems  of  spillover.  Finally,  spillover  has 
not  posed  serious  problems  as  we  are  able  to  design  the  controller,  in  the  case  of  a  beam, 
using  a  low-order  model  and  verify  stability  for  a  high-order  model 
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4.  Nonlinear  Inversion-Based  Regulation 

Summary 

In  this  study  a  new  inverse  is  introduced  for  nonlinear  systems.  This  inverse 
agrees  with  that  of  Hirschom  for  minimum  phase  nonlinear  systems,  but  is  noncausal 
(rather  than  unstable)  in  the  nonminimum  phase  case.  Further,  a  geometric  connection  is 
made  between  the  unstable  manifold  of  the  system  zero  dynamics  and  the  noncausality  in 
the  inverse.  With  the  inverse  used  for  generating  feedforward,  exact  regulation  along  a 
desired  trajectory  is  easily  accomplished  with  the  addition  of  stabilizing  feedback;  this  is 
demonstrated  Avith  a  numerical  example  and  compared  to  the  Bymes-Isidori  regulator. 
Rather  than  solving  a  PDE  to  construct  a  regulate,  the  inverse  is  easily  constructed  using  a 
Picard-like  iteration.  Moreover,  when  preactuation  is  not  possible,  noncausal  inverse 
trajectories  can  be  truncated.  The  result  is  the  introduction  of  transients  common  in  other 
regulators. 

Introduction 

Tracking  control  and  regulation  are  conunon  problems  in  applications  and  have  thus 
attracted  considerable  attention  from  control  researchers.  Asymptotic  tracking  has  been 
solved  for  arbitrary  reference  trajectories  in  the  context  of  linear-quadratic  optimal  control 
[21].  Also  for  linear  systems,  the  asymptotic  rogulation  and  tracking  of  signals  generated 
by  finite  dimensional  linear  systems  has  been  studied  in  a  general  framework  by  Francis 
and  Wonham  [22].  These  authors  show  that  the  tracking  problem  is  solvable  if  and  only  if 
a  set  of  linear  matrix  equations  is  solvable.  In  the  nonlinear  case,  the  Francis-Wonham 
equations  have  been  generalized  to  a  first-oider  partial  differential  algebraic  equation  (PDE) 
by  Byrnes  and  Isidori  [23].  This  fundamental  work  has  been  augmented  with  tests  for 
approximate  solvability  of  the  Bymes-Isidori  PDE  [24]  and  methods  for  optimal  regulator 
design  [25].  In  addition,  extensions  to  the  Bymes-Isidori  regulator  have  been  described 
in  [26],  and  [27].  In  this  paper  we  introduce  a  new  inversion-based  approach  to  exact 
nonlinear  output  tracking  control.  The  basic  idea  presented  is  to  use  feedforward  of  a 
nominal  input  which  forces  a  given  system  along  a  desired  trajectory,  and  then  to  stabilize 
the  trajectory  using  feedback.  We  use  preactuation  to  establish  initial  conditions  in  the 
nonminimum  phase  case,  in  contrast  to  setting  initial  conditions  as  is  done  in  [28].  Our 
approach  eliminates  the  requirement  for  solving  the  partial  differential  algebraic  equation  (of 
potentially  high  dimension)  encountered  in  the  Bymes-Isidori  regulator  by  tracking  a 


specific  trajectory  rather  than  any  one  of  a  family.  Moreover,  no  exosystem  is  required, 
and  the  specification  of  trajectories  is  simplified.  We  do,  however,  introduce  boundedness 
and  integrability  requirements  on  the  trajectory.  The  key  to  our  approach  is  finding  a 
bounded  inverse,  even  for  nonminimum  phase  nonlinear  systems,  for  use  in  generating 
feedforward  inputs.  In  contrast  to  the  inversion  approach  of  Hirschom  [29]  where 
unstable  zero  dynamics  lead  to  unbounded  responses  of  the  inverse  system,  we  introduce  a 
nonlinear  operator  which  is  noncausal  in  n.c  nonminimum  phase  case.  Noncausal 
feedforward  can  be  used  in  the  case  where  trajectory  preview  is  possible,  or  truncated  to  a 
causal  signal  at  the  cost  of  introducing  nansient  tracking  errors.  Such  noncausal  character  is 
seen  in  the  linear  quadratic  setting,  but  the  use  of  exact  inverses  in  both  linear  and  nonlinear 
tracking  control  is  new.  The  noncausal  inverses  used  here  are  motivated  by  the  work  of 
Bayo  [30]  in  flexible  multibodies  and  have  been  applied  to  the  control  of  flexible-link 
robots  in  [31]. 

Problem  Formulation 

Consider  the  nonlinear  system 


X  =  f(x)  +  g(x)u 

(4.1) 

y  =  h(x). 

(4.2) 

defined  on  a  neighborhood  X  of  the  origin  of  51”,  with  input  u  e  and  output  y  e  91** . 
The  functions  fix),  gi(x)  (the  ith  column  of  g(x))  i  -  1,  2, . . .,  p  are  smooth  vector  fields 
and  hi(x)  for  i  =  1,  2, . . .,  q  are  smooth  functions  on  X,  with  fiO)  =  0  and  h(0)  =  0. 

In  the  context  of  the  above  system  pose  the  following 
Stable  Inversion  Problem:  Given  a  smooth  reference  output  trajectory  y(j(t)  e  Lj  n  L,„, 

(bounded,  integrable  signals)  find  a  control  input  u<i(t)  and  a  state  trajectory  x<i(t)  such  that 

1 )  ud  and  xd  satisfy  the  differential  equation 

Xd(t)  =  f  (xd(t))  +  g(x(i(t))  Ud(t),  (4.3) 

2)  exact  output  tracking  is  achieved; 


h(xd(0)  =  yd(t). 


(4.4) 


3)  ud  and  Xd  are  bounded,  and 
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Ud(t)  0,  Xd(t)  ->  0  as  t  ->  ±00.  (4.5) 

We  call  Xd  the  desired  state  trajectory  and  ud  the  nominal  control  input  These  can  be 
incorporated  into  a  regulator  by  using  the  nominal  control  input  as  a  feed-forward  signal 
and  X  -  Xd  as  an  error  signal  for  feedback. 

Results 

In  solving  for  the  nominal  trajectories  xd  and  Ud  the  concepts  of  stable  and  unstable 
manifolds  of  an  equilibrium  point  arise  naturally  [32].  For  the  sake  of  completeness  we 
review  the  definitions  here.  Let  z  =  0  be  an  equilibrium  point  of  an  autonomous  system 
defined  in  a  open  neighborhood  U  of  the  origin  of  91”: 

z  =  f(z),  (4.6) 

and  <])t(z)  be  the  flow  passing  through  z  at  t  =  0.  We  define  the  (local)  stable  and  unstable 
manifolds  W*,  W**  as  follows: 

W*  =  {z  €  UI(J)t(z)  e  UVt  ^  0,  (l>t(z)  0  as  t  00}  (4.7) 

W“  =  {z  €  U  I  ({)t(z)  €  UVt  ^  0, 0t(z)  0  as  t  -+  -«»}  (4.8) 

The  equilibrium  point  z  =  0  is  said  to  be  hyperbolic  if  the  Jacobian  matrix  Df  of  f  at 
z  =  0  has  no  eigenvalues  on  the  jeo  axis.  Let  n^  denote  the  number  of  eigenvalues  of  Df  in 
the  open  left  half  complex  plane,  and  n^  the  number  in  the  open  right  half  plane.  Stable  and 
unstable  manifolds  W*  and  W'*  exist  locally  in  the  neighborhood  of  hyperbolic  fixed  point 

and  have  dimensions  n^  and  n**  respectively. 

For  convetuence.  we  will  use  the  following  notation.  LetH^  {0,  1,  2,  .  .  .},  r  = 

{ri,  r2,  .  .  rq)t  €  tL™  and  y  =  [yi(t),  y2(t),. . yq(t)]\  t  e  91.  Then  we  define 
Ir  1 A  q  T2+.  .  .+rq  and  write 
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yi 

dt^i 


dt^2 

dt'** 


We  will  use  the  bold  number  1  to  denote  the  vector  (1, 1, . . .,  1)‘  so  that 


^  dy2 

dt  ’  dt  ’ 


If  y:  51"  -4  91**  and  f:  91"  — >  91",  we  define 


(4.9a) 


(4.9b) 


(4.10) 


Partial  linearization  and  inversion 

The  system  dynamics  (4.1, 4.2)  is  written  in  the  following  form,  where  the  number 
of  inputs  (p)  is  assumed  to  be  the  same  as  the  number  of  ou^uts, 

x  =  f(x)+ £  gi(x)Ui  (4.11a) 

i=l 


Yl  =  hi(x) 


(4.11b) 


Ym  =  hp(x) . 


We  assume  that  the  system  has  well-defined  relative  degree  r  =  (rj,  12,-..,  Tp)  at  the 
equilibrium  point  0,  that  is, 

(i)  for  ull  1  ^  j  ^  p,  for  all  1  ^  i  ^  p,  for  all  k  <  Tj  - 1,  and  for  all  x  in  a  neighborhood 
of  the  origin. 


Lg.  Lf  hi(x)  =  0 , 


(4.12) 


(ii)  the  p  X  p  matrix 


Lgj  L?-'  hi(x) 


P(x)  = 


Lgj  L?  ‘  h2(x) 


Lg^L?-'hi(*) 

Lj^L?-'h2(x) 


Lg,Lf‘ hp(x)  •-  LjpLf‘hp(x) 


(4.13) 


is  nonsingular  in  a  neighborhood  of  the  origin. 

Under  this  assumption,  the  system  can  be  partially  linearized.  To  do  this,  we 
differentiate  y,  until  at  least  one  uj  appears  explicitly.  This  will  happen  at  exactly  the  rjth 
derivative  of  yi  due  to  (4.12).  Define  =  yp”^^  for  i  =  1, . . .  p  and  k  =  1, . .  .,  ri,  and 
denote 


=  (yi.  j\. yz. • 


(4.14) 


Choose  T),  an  n-lrl  dimensional  function  on  91"  such  that  =  \|r(x)  forms  a 

change  of  coordinates  with  V|/(0)  =  0  [32].  In  this  new  coordinate  system,  the  system 
dynamics  of  equation  (4.1)  becomes 
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fori  =  l,  •••,  p 

4l-  =  “i(5.n)+|3i(in)u 

V  * 

n  =  qi(4.n)+q2(^.  n)u. 

which,  in  a  more  compact  form,  is  equivalent  to 

y('>  =  a{^,  q)  +  P(§,  q)u, 
fl  =  Si(4.Tl)+S2(^.  n)u. 

where 

y  =  (>i.  y2.  yp/ » 

u  =  (ui,  U2,*”,  Up)*, 

a(^,  q)  =  L"fh(v~Vi  n)). 

P(^.ti)  =  L*gL'f'h(v-i(5,q)). 

Here  P  is  acmally  the  same  P(x)  matrix  defined  in  the  equation  (4.13),  a(0,  0) 
f(0)  =  0,  and 

h(x)  =  [hi(x),  h2(x),  hp(x)]‘ , 

g(x)=[gi(x),g2(x),”-,gp(x)]. 


(4.15a) 

(4.15b) 

(4.16) 

(4.17) 


(4.18) 

(4.19) 

=  0  since 
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Since  by  the  relative  degree  assumption,  P(^,  T])  is  nonsingular,  the  following  feedback 
control  law 


uA[P(^,n)]  ^[v-a(^,Ti)]  (4.20) 

is  well  defmed  and  partially  linearizes  the  system  such  that  the  input-output  relationship  is 
given  by  a  chain  of  integrators: 

y^^>  =  V  (4.21) 

where  v  €  31^  is  the  new  control  input  Assume  both  y  and  yd  start  from  rest  and  choose 

v  =  yi'^  (4.22) 


Then  immediately  we  have 

^  A  (ydi.  ydb  yd?’^^*  yd2.  yd?'^^«  y^p (^-23) 

and  equation  (16)  becomes,  which  we  call  the  reference  dynamics,  or  the  zero  dynamics 
driven  by  the  reference  output  trajectory. 


(4.24) 


where 


s(yd.  4d.  "n)  A  si(^d.  ■n)+s2(^d.  n)  [P(^d.  n)]  *  [yj'^  -a(^d*  ‘n)]-  (4.25) 


under  certain  technical  conditions  on  s  (see  [33])  there  exists  a  solution 
Ti(j(  )  €  LinLo,nC®,  to  (51).  Once  the  solution  to  the  zero  dynamics  is  found,  an 
integration  of  the  reference  dynamics  gives  rise  to  a  trajectory  of  the  original  state  through 


the  inverse  coordinate  transformation  Xd  =  X|/ 


'^1 

'.’Id, 


and  an  input  trajectory,  Ud,  by 


equation  (4.20). 
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If  (u  Yd)  has  a  compact  support,  [to,  tiJ,  then  it  is  possible  to  give  a  geometric 

interpretation  of  the  evolution  of  xd(t)  [34],  The  noncausal  part  of  the  nominal  control 
drives  the  internal  states  of  the  system  along  the  unstable  manifold  of  the  zero  dynamics 
manifold  to  a  particular  initiai  condition  Xd(to)  while  maintaining  zero  system  output  This 
initial  condition  guarantees  two  things:  1)  the  desired  reference  output  trajectory  is  easily 
reproduced  with  bounded  input  and  states;  2)  the  internal  states  land  on  the  stable  manifold 
of  the  zero  dynamics  manifold  at  the  end  of  output  tracking.  With  this  nice  final  condition, 
the  internal  states  will  converge  to  zero  along  the  stable  manifold  without  affecting  the 
output. 

Conclusions 

We  have  introduced  a  new  nonlinear  operator  whose  application  in  nonlinear 
inversion  yields  a  clear  connection  between  unstable  zero  dynamics  and  noncausal 
inversion.  When  noncausal  inversion  is  incorporated  into  tracking  regulators,  we  can  see 
that  it  is  a  powerful  tool  for  control  -  particularly  when  computation  is  considered.  An 
imponant  fact  is  that  a  given  system  model  defines  different  input-output  operators 
depending  on  how  boundary  conditions  are  applied.  For  the  study  of  feedforward  control, 
boundary  conditions  at  infinity  give  a  useful  perspective  on  a  system.  We  have  considered 
only  the  case  of  hyperbolic  zero  dynamics.  Cases  where  zero  dynamics  have  a  center 
manifold  or  a  hyperbolic  orbit  should  prove  interesting  as  well. 
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Abstract.  Alter  a  general  review  of  the  methods  currently  available  for  the  dynamics  of  constrained  multibody 
systems  in  the  context  of  numerical  efficiency  and  ability  to  solve  the  differential  equations  of  motion  in  singular 
positions,  we  examine  the  acceleration  based  augmented  Lagrangian  formulations,  and  propose  a  new  one  for 
holonomic  and  non-holonomic  systems  that  is  based  on  the  canonical  equations  of  Hamilton.  This  new  one  proves 
to  be  more  stable  and  accurate  that  the  acceleration  based  counterpart  under  repetitive  singular  positions.  The 
proposed  algorithms  are  numerically  efficient,  can  use  standard  conditionally  staUe  numerical  integrators  aitd  do 
not  fail  in  singular  positions,  as  the  classical  formulations  do.  The  reason  for  the  numerical  efficieiKy  and  better 
behavior  under  singularities  relies  on  the  fact  that  the  leading  matrix  of  the  resultant  system  of  OD^  is  sparse, 
symmetric,  positive  definite,  and  its  rank  is  independent  of  that  of  the  Jacobian  of  the  constraint  equations.  The 
latter  fact  makes  the  proposed  method  particalarly  suitable  for  singular  configurations. 

Key  words:  Constrained  multibody  systema  penalty  and  augmented  Lagrangian  method,  holonomic  and  non- 
holonomic  constraints,  canonical  equations  of  Hamilton. 


1.  Review  of  Current  Approaches  for  Multibody  Dynamics 

Computer  systems,  while  increasing  tremendously  in  power  in  recent  years,  are  so  affordable 
now^ays,  that  their  use  have  become  widely  spread  in  many  different  fields  and  for  a 
number  of  applications.  The  computer  kinematic  and  dynamic  analysis  of  multibody  systems 
is  increasingly  being  used  in  fields  such  as  the  automobile  industry,  aerospace,  robotics, 
machinery,  biomechanics,  etc.,  and  it  has  been  receiving  considerable  attention  recently, 
as  seen  by  the  amount  of  literature  on  multibody  simulation  and  computer  aided  analysis 
programs  being  sold  in  the  market  of  engineering  software.  Nevertheless,  there  is  an  iiKreasing 
demand  for  faster  and  more  reliable  simulations  that  must  be  based  on  more  efficient  aiKl  robust 
algorithms  for  multibody  dynamics. 

The  dynamic  analysis  of  multibody  systems  is  a  process  which  is  most  appropriately 
performed  using  interactive  (rather  than  batch)  type  of  analysis.  The  analyst  is  interested  in 
visualizing  a  whole  set  of  successive  responses  of  the  multibody,  a  simulation  of  its  behavior 
and  operation  over  all  the  mechanism  workspace  and  over  a  certain  period  of  time.  In  certain 
cases  it  may  be  even  necessary  to  introduce  the  engineer  as  an  additional  element  in  the 
simulation,  called  “man-in-the-loop”,  who  may  act  by  introducing  external  forces  or  control 
over  specific  degrees  of  freedom.  In  any  case,  each  response  over  a  time  step  needs  to  be 
calculated  and  displayed  at  the  highest  speed  possible  in  order  to  give  a  picture  that  will 
possibly  resemble  the  actual  motion  of  the  system  in  both  time  and  space:  the  real  time 
behavior. 

While  it  is  important  for  multibody  dynamic  simulations  to  have  fast  and  accurate  inter- 
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active  graphical  interfaces,  it  is  essential  that  the  computer  software  relies  on  good  numerical 
algorithms  that  will  permit  a  fast  and  reliable  solution  of  the  resulting  algebraically  constrained 
differential  equations  of  motion.  Consequently,  it  becomes  very  important  that  numerical 
efficiency  and  stability  be  combined  with  robustness  so  that  the  simulation  does  not  reach 
dead-lock  situations  due  to  singularity  positions  in  the  multibody  motion. 

In  recent  years,  some  important  advances  have  been  made  in  the  development  of  new 
formulations  for  multibody  dynamics.  Some  formulations,  stemming  mainly  from  the  robotics 
field,  have  been  especially  conceived  for  real-time  simulation  and  are  based  on  the  recursive 
computation  of  some  or  all  of  the  terms  in  the  equations  of  motion  [1-4].  Some  of  these 
algorithms  are  0{N)  meaning  that  the  number  of  floating  point  arithmetic  operations  grows 
linearly  with  the  number  of  degrees  of  freedom.  Others  require  the  solution  of  a  system  of 
N  linear  equations  and,  therefore,  are  of  order  0{N^)  (if  Gaussian  elimination  is  used). 
Although  it  has  been  demonstrated  by  Featheistone  [3]  that  the  best  0{N^)  algorithms  are 
faster  than  the  best  0{N)  algorithms  for  N  <  10,  the  elegance  and  attractiveness  of  the 
0{N)  Featherstone’s  formulation  has  exerted  a  strong  influence  on  later  developments  that 
have  generalized  these  ideas  for  non-serial  (tree-configuration)  and  closed-loop  systems  [6-7]. 
A  limitation  arises  when  closed-chain  multibodies  are  analyzed,  since  for  these  cases  special 
provisions  must  be  made  to  account  for  the  reaction  forces  between  the  different  loops  [6]. 

The  second  group  of  methods  encompasses  those  that  reduce  the  equations  of  motion  in 
dependent  coordinates  to  a  minimum  set  of  independent  ones  via  a  transformation  matrix 
obtained  from  the  nullspace  of  the  Jacobian  of  tli»  constraint  equations.  Different  methods 
of  choosing  the  independent  set  of  coordinates  and  generating  the  transformation  matrix 
have  been  proposed  [8-14].  The  concept  of  velocity  transformations^  initially  introduced  by 
Jerkovsky  [  15],  has  been  subsequently  extended  into  efficient  algorithms  [16-19]  that  avoid  the 
Jacobian  factorization,  arid  allow  for  an  efficient  and  simple  way  of  generating  the  equations 
of  motion  in  independent  coordinates  in  a  way  that  can  be  fully  parallelized  [19]. 

The  classical  way  to  generate  the  equations  of  motion  is  to  use  dependent  (or  absolute) 
coordinates  to  generate  and  solve  the  equations  of  motion  [14].  These  algorithms  are  based 
on  the  classical  Lagrange’s  formulation  which  leads  to  a  set  of  differential  and  algebraic 
equations  (DAE)  of  motion  with  the  coordinates  and  multipliers  as  unknowns.  The  solution  of 
these  equations  require  special  techniques  [20]  whose  merit  has  not  been  thoroughly  calibrated 
yet  for  the  integration  of  multibody  systems.  A  way  to  avoid  the  DAE  is  by  differentiating 
the  constraints.  The  resulting  constraint  violations  are  commonly  stabilized  using  the  method 
proposed  by  Baumgarte  [21].  An  extension  for  violation  stabilization  of  holonomic  systems 
based  on  the  use  of  the  canonical  momenta  has  been  proposed  in  [22]. 

Although  the  methods  described  above  are  well  established  (some  of  them  very  efficient 
numerically),  they  can  not  directly  handle  redundant  constraints.  In  fact  these  have  to  be 
eliminated  prior  to  the  dynamic  analysis.  In  addition  they  all  fail  to  give  successful  solutions 
when  the  multibody  undergoes  a  singular  position.  A  partial  solution  to  the  problem  of  singular 
positions  was  provided  in  [23]  where  a  method  is  developed  that  detects  the  ill-conditioning  of 
the  Jacobian  matrix  so  that  the  integrator  can  step  over  it  In  [24-25]  a  regularization  method 
is  proposed  to  cope  with  singularities.  The  main  idea  consists  in  adding  to  the  vanishing  and 
the  linearly  dependent  constraints  their  third  derivatives,  and  this  turns  the  Jacobian  non- 
singular.  A  staggered  stabilization  approach  was  presented  by  Park  and  Chiou  [26]  which 
was  later  refined  by  means  of  an  explicit-implicit  integration  procedure  [27].  This  method 
integrates  two  different  sets  of  equations  one  for  the  coordinates  and  another  for  the  Lagrange 
multipliers,  and  avoids  the  singular  position  problem  of  the  equations  of  motion.  A  small 
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limitation,  however,  is  that  it  requires  the  inversion  of  the  mass  matrix,  which  is  in  general 
semi-positive  definite  and  may  not  have  an  inverse  in  certain  instances  (in  particular  when 
redundant  dependent  coordinates  are  used). 

Bayo,  Garcia  de  Jalon  and  Serna  proposed  a  penalty  method  [28]  by  which  the  acceleration, 
velocity  and  position  constraint  conditions  are  added  to  the  equations  of  motion  as  a  “dynami¬ 
cal  penalty  system”  to  obtain  a  simple  and  efficient  formulation  for  the  dynamic  equations.  The 
appeal  of  this  formulation  lies  in  two  main  points.  Firstly,  it  leads  to  a  reduced  set  of  equations 
in  the  form  y  =  g(y,  0  can  be  integrated  by  standard  conditionally  st^le  numerical 
algorithms,  without  the  need  of  further  stabilization  techniques  to  control  the  violation  of  the 
constraints  during  the  integration  process.  Secondly,  unlike  the  classical  methods  which  rely 
on  the  Jacobian,  this  penalty  formulation  leads  to  matrices  that  can  be  inverted  even  in  singular 
positions,  and  in  the  presence  of  redundant  (liitearly  dependent)  constraints  and  coordinates. 
Important  theoretical  studies  of  its  convergence  and  stability  have  been  carried  out  in  [29]  and 
[30].  The  penalty  method  of  [28]  has  also  been  successfully  extended  to  real  time  dynamics 
within  the  context  of  fully  Cartesian  coordinates  in  [31].  There,  it  has  been  shown  that  the 
penalty  method  requires  the  factorization  of  a  symmetric  matrix  that  is  dominated  by  the  terms 
in  the  main  diagonal  (no  pivoting  is  required),  and  is  strongly  banded,  feature  that  makes  it 
an  order  n  method,  where  n  is  the  number  of  coordinates.  In  addition,  the  different  steps  of 
the  algorithm  can  be  parallelized,  making  this  method  suitable  for  very  large  systems. 

It  was  also  proposed  in  [28]  a  more  complete  and  accurate  augmented  Lagrangian  method 
(combination  of  the  penalty  formulation  and  Lagrange’s  multipliers),  which  allows  for  con¬ 
vergence  independently  of  the  penalty  values  and  which  yields  the  constraint  forces  (Lagrange 
multipliers)  as  a  by-product  without  having  to  integrate  additional  equations.  In  this  paper  we 
examine  this  augmented  Lagrangian  formulation  within  the  context  of  singular  positions  and. 
in  addition,  propose  a  new  one  based  on  the  use  of  the  canonical  equations  of  Hamilton  that 
is  even  more  stable  and  numerically  efficient  than  the  previous  one. 


2.  Preliminaries  on  the  Classical  Formulations 

2.1.  Acceleration  Based  Lagrange’s  Multiplier  Formulation 


Let  us  consider  a  multibody  system  whose  configuration  is  characterized  by  n  generalized 
coordinates  q  that  are  interrelated  through  the  m  holonomic  kinematic  constraint  cotKlitions 

*(q,t)  =  0  (1) 


Let  £.  be  the  system  Lagrangian,  defined  by  C  =  T  -  V  ,  where  T  and  V  are  the  kinetic 
and  potential  energy,  respectively;  and  let  Q  be  the  vector  of  external  and  non-conservative 
forces.  The  Lagrange  equations  of  such  a  system  can  be  written  as  [32] 


1 

dt  Vaq  j 


dL  ^ 


(2) 


which  for  a  general  multibody  system  leads  to: 
Mq  +  *5 A  =  Q  -»-  Lq  -  Mq, 


(3) 


where  M  is  the  mass  matrix,  £q  is  the  partial  derivative  of  the  Lagrangian  with  respect  to 
the  coordinates,  4q  is  the  Jacobian  of  the  constraint  equations,  Q  is  the  vector  of  external 
and  non-conservative  forces,  and  A  is  the  vector  that  contains  the  Lagrange's  multipliers. 


4  E.  Bayo  and  A.  Avello 


Equations  (1)  and  (3)  constitute  a  set  of  n  +  m  mixed  differential  algebraic  equations  (DAE) 
of  index  three  [20],  with  q  and  A  as  unknowns.  In  order  to  avoid  the  direct  integration  of 
DAEs,  a  double  differentiation  of  the  constraints  equations  may  be  carried  out,  which  along 
with  the  Baumgarte’s  stabilization  [21]  yields; 


M  rql^f  Q  +  Iq-Mq  1 

0  J  I  ^  .1  [  -^qq  -  -  a*  -  6$  J 


(4) 


where  a  and  b  are  the  stabilization  constants.  These  equations  can  now  be  integrated  using 
standard  numerical  integrators  [33]  with  each  function  evaluation  performed  using  equation 
(4). 


2.2.  Lagrange’S  multiplier  Formulation  in  Canonical  Form 


The  definition  of  the  conjugate  or  canonical  momenta  can  be  taken  from  classical  mechanics 
[32] 


along  vrith  the  Hamiltonian 

H  =  p^q  -  L.  (6) 


The  canonical  equations  of  Hamilton  for  a  constrained  system  are  formulated  as 


-‘■= 


A. 


(7a) 

(7b) 


In  the  case  of  multibody  systems  the  Lagrangian  L  is  defined  in  terms  of  q,  q  and  t,  and  rather 
than  following  a  lengthy  process  to  form  the  Hamiltonian  as  an  explicit  function  of  q,  p  and 
t,  and  then  differentiate  as  in  (7a),  the  canonical  equations  can  be  directly  obtained  from  (5) 
and  (7b).  Since  the  system  kinetic  energy  is  a  quadratic  function  of  the  generalized  velocities, 
(5)  and  (7b)  directly  lead  to  the  following  set  of  equations  in  matrix  form 


p  =  Mq 

(8a) 

p  =  Lq  +  Q-*5A. 

(8b) 

The  combination  of  (8a-b)  and  (1)  constitutes  a  system  of  2n  -f  m  differential  and  algebraic 
equations  (DAE),  of  index  two.  Note,  that  although  equations  (8a-b)  have  n  more  equations 
than  (3),  p  can  be  obtained  explicitly  from  (8b).  In  addition,  index  two  DAEs  are  better 
behaved  than  index  three  DAEs  [20],  and  therefore  the  consideration  of  equations  (8a-b)  may 
be  numerically  more  advantageous  than  (3),  when  using  algorithms  for  the  solution  of  the 
mixed  differential  algebraic  equations. 

In  order  to  avoid  the  mixed  differential  and  algebraic  equations,  the  system  Lagrangian  is 
modified  in  [22]  and  [34]  to  include  the  kinematic  velocity  constraints  as 

L*  =  L  +  4^(7, 


(9) 
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whe?«  cr  arc  the  new  Lagrange  multipliers.  The  new  Hamiltonian  is  H  =  p^q  -  L*  and  the 
application  of  (3)  and  (5b)  leads  to 

Pnet.,  =  M  q  +  *  J<T  (10a) 

Pn^u,  =  Iq  +  Q  +  (10b) 

that  along  with 

^ +7^  =  $qq-f- +7^  =  0  (11) 

constitutes  a  set  of  2n+m  ordinary  differential  equations  (ODE),  with  p,  q  and  o  as  unknowns. 
The  real  constant  7  provides  asymptotic  stability  of  the  stabilization  scheme.  It  can  be  very 
easily  verified  by  differentiation  of  (10a)  and  substitution  in  (lOb)  that  &  =  X. 

It  is  worth  mentioning  that  only  the  following  n + m  equations  need  be  solved  at  each  time 
step  in  the  numerical  implementation  of  the  algorithm; 

r  M  « J  1  f  q  1  f  Pnew  1 


— —  7^ 


The  numerical  simulations  of  [34]  show  that  since  only  the  first  time  derivative  of  the  con¬ 
straints  is  used,  the  integration  of  this  equations  is  more  efficient  and  more  stable,  than  the 
acceleration  based  counterparts. 


2.3.  reduction  TO  AN  INDEPENDENT  Set  OF  COORDINATES 

The  other  widely  accepted  group  of  methods  for  multibody  dynamics  is  based  on  the  use 
of  a  transformation  matrix  R  that  will  reduce  the  equations  of  motion  to  a  minimum  set  of 
coordinates.  The  matrix  R  is  obtained  from  the  concept  of  the  nullspace  of  the  Jacobian, 
and  allows  one  to  express  equation  (3)  in  terms  of  an  independent  set  of  coordinates.  The 
procedure  starts  by  dififerentiating  the  constraint  equations  4^(q)  =  0.  that  for  simplification 
purposes  we  assume  are  sclerenomous,  to  obtain 

*qq  =  0.  (13) 

It  may  be  seen  from  equation  (13)  that  q  belongs  to  the  nullspace  of  the  Jacobian  ^q.  The 
dimension  of  the  nullspace  is  equal  to  /,  where  /  is  the  number  of  degrees  of  freedom  of 
the  multibody  system.  We  can  always  express  q  as  a  linear  combination  of  the  vectors  of  a 
nullspace ’s  basis,  in  the  form 

q  =  Hi,  (14) 

where  R  is  an  rt  X  /  matrix  whose  columns  constitute  a  basis  of  the  nullspace,  and  z  are 
the  /  independent  velocities.  SiiKC  R  constimtes  a  bas's  of  the  nullspace  of  the  Jacobian, 
it  satisfies  the  relationship  ^qR  =  0.  The  matrix  R  may  be  obtained  from  the  Jacobian  by 
projection  methods  using  Gauss  factorization  [8],  the  singular  ^alue  decomposition  [9]  or  the 
QR  method  [10].  It  can  also  be  obtained  more  efficiently  by  velocity  transformations  [15-19]. 
The  substitution  of  (14)  into  (3)  and  premultiplication  by  R^  yields: 

R^M  R  z  =  R^(Q  -F  Iq  -  Mq)  -  R^M  R  z  (15) 

from  which  z  can  be  calculated.  An  extension  of  this  method  within  the  setting  of  canonical 
equations  has  been  proposed  in  [35],  where  the  same  leading  matrix  is  obtained. 
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Fig.  2.  i.  Slider-crank  mechanism  motion,  b.  Rotating  bar  -notion. 


2.4.  Why  the  Classical  Formulations  Fail  in  Singular  Positions 

As  mentioned  before,  a  singular  position  is  encountered  when  the  multibody  reaches  a  config¬ 
uration  in  which  there  is  a  sudden  change  in  the  number  of  degrees  of  freedom.  For  instance, 
a  slider-crank  mechanism  as  the  one  shown  in  Figure  1,  reaches  a  singular  position  when 
the  two  links  are  in  vertical  position.  In  that  configuration,  both  links  are  coincident  and  the 
mechanism  has  not  one  but  two  degrees  of  freedom.  These  two  degrees  of  freedom  corre¬ 
spond  to  the  two  possible  motions  (bifurcations)  that  the  mechanism  can  undergo,  and  which 
are  illustrated  in  Figure  2.  Figure  2a  shows  the  first  possible  motion  that  corresponds  to  a 
slider-crank  mechanism.  Figure  2b,  shows  the  second  motion  corresponding  to  a  rotating  bar 
(in  fact  two  coincident  rotating  bars).  As  may  be  seen,  a  singular  position  implies  a  bifurcation 
poinL  in  which  the  mechanism  can,  at  least  theoretically,  undergo  different  paths. 

The  existence  of  a  singular  position  with  both,  the  classical  Lagrange’s  multipliers  approach 
and  the  reduction  to  a  set  of  itKlependent  coordinates,  is  invariably  detected  when  the  Jaco¬ 
bian  matrix  of  the  constraints  be^mes  rank  deficient  These  formulations  are  based  on  the 
decomposition  of  the  Jacobian  and  since  its  rank  suddenly  falls  at  a  singular  position,  the 
decomposition  fails  and  therefore  no  solution  can  be  found.  The  simulation  then  crashes  not 
because  of  the  physics  of  the  problem,  but  because  of  the  inability  of  the  dynamic  formulation 
to  overcome  the  sudden  change  in  the  rank  of  the  Jacobian. 

Equations  (4)  and  (12)  are  the  key  equations  for  the  solution  of  the  dynamics  using  the 
Lagrange’s  multipliers  method.  Assuming  that  all  the  constraints  are  independent,  that  is  T 
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m  —  n  -  f,lhe  rank  of  the  leading  matrices  in  those  equations  is  n  4-  m.  Since  the  Jacobian 
becomes  rank  deficient  in  singular  positions,  this  matrix  becomes  singular.  This  means  that  the 
accelerations  (or  velocities)  may  not  be  computed  unless  special  cate  is  taken  to  eliminate  (or 
regularize),  at  that  paitiailar  position  of  the  multibody  system,  all  the  vanishing  constraints. 
Otherwise,  the  dynamic  simulation  crashes  at  this  point  Equation  (15)  is  the  alternate  key 
equation  for  the  independent  cootdinate  method.  Again,  when  a  singular  position  is  reached 
special  provisions  have  to  be  made  for  the  computation  of  the  matrix  R. 

If  a  singular  position  is  not  exactly  reached,  the  leading  matrix  of  both  classical  methods 
will  not  be  strictly  singular,  but  near  singular,  with  a  very  high  condition  number.  If  this 
situation  is  not  correctly  tracked,  the  integration  and  round-off  errors  will  be  amplified  and 
the  resulting  solutions  may  be  totally  erroneous. 

It  is  important  at  this  stage  to  emphasize  the  difference  between  a  singular  Jacobian  and  a 
singular  position.  While  a  singular  position  always  implies  a  singular  Jacobian,  the  converse 
is  not  always  true.  A  Jacobian  can  become  singular  when  redundant  constraints  are  present, 
a  dead-lock  position  is  reached,  or,  when  the  coordinate  partitioning  between  dependent  and 
independent  coordinates  is  not  made  properly  or  has  not  been  updated  for  a  while.  Qintrary 
to  the  case  of  a  singular  position,  these  singularities  can  be  easily  avoided  and  the  simulation 
may  proceed  smoothly.  The  difference  between  singular  Jacobian  and  singular  positions  can 
be  better  understood  by  partitioning  the  columns  of  the  Jacobian  #q  into  two  submatnees 
and  ^q*,  corresponding  to  the  dependent  and  independent  coordinates,  respectively. 
This  partition  is  made  so  that  ^q**  has  fuU  row  rank.  When  ^q'*  is  rank-deficient  but  ^q  has 
fuU  row  rank  the  singularity  is  easily  avoidable  since  the  full  rank  of  ^q*^  can  be  recovered 
by  a  new  suitable  choice  of  independent  coordinates.  However,  at  a  singular  position  ^q 
looses  rank  all  of  a  sudden,  and  the  singularity  may  only  be  avoided  by  eliminating  the  non¬ 
active  constraints.  As  pointed  out  in  [25]  if  these  non-active  constraints  are  eliminated  in  the 
neighborhood  of  the  singular  configurations  the  corresponding  constraint  forces  become  zero 
and  this  may  result  in  a  fast  deviation  of  the  simulation  from  the  constrained  behavior. 


3.  Acceleration  Based  Augmented  Lagrangian  Formulation 

3.1.  Description  OF  THE  METHOD 

For  the  sake  of  completeness  and  in  order  to  facilitate  the  understanding  of  the  methods 
proposed  in  sections  4  and  6,  we  present  in  this  section  the  augmented  Lagrangian  method 
introduced  in  [28].  Later  we  will  address  its  behavior  in  singular  positions.  Given  a  multibody 
system  with  holonomic  constraint  equations  of  the  form  given  in  (1),  which  represent  a  set  of 
nonlinear  algebraic  equations  in  the  coordinates  and  the  time  variable.  The  penalty-augmented 
Lagrangian  formulation  proposed  in  [28]  is  derived  by  adding  to  the  Lagrangian  two  terms:  a 
fictitious  potential 

V*  = 

k  ^  ^ 

and  a  fictitious  kinematic  energy  term 


=  =2*  “*■ 


T 


(17) 
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A  set  of  Rayleigh’s  dissipative  forces  is  also  added  to  the  system 
Gk  =  -2ak  Wfc  =  -2oi  Cl/ii, 


(18) 


where  at  are  large  positive  real  values  (penalty  numbers),  and  Wk  and  fik  represent  the  natural 
frequency  and  the  damping  ratio  of  the  dynamic  penalty  system  (mass,  dashpot  and  spring) 
corresponding  to  the  constraint  ^k  =  0.  Matrices  a,  H  and  /i  are  m  x  m  diagonal  matrices 
that  contain  the  values  of  the  penalty  numbers,  the  natural  frequencies  and  the  damping  ratios 
of  the  penalty  systems  assigned  to  each  constraint  condition.  If  the  same  values  are  u^  for 
each  constraint  these  matrices  become  identity  matrices  multiplied  by  the  respective  penalty 
numbers.  Note  that  in  equations  (16)  through  (18)  we  have  used  both  index  as  well  as  matrix 
notation,  hoping  that  this  will  lead  to  a  better  understanding  of  the  physical  significance  of 
the  different  terms.  In  the  following  discussion  we  will  only  use  the  matrix  form  in  order  to 
be  consistent  with  the  notation  used  so  far  in  the  paper. 

The  differentiation  of  the  new  Lagrangian  lea^  to 


^  =  Lq  + 

dL^  .  iT  A 
=  M  q  +  9 


-( 

dt  V 


dL* 

d<i 


)=Mq  +  Mq  +  ija  «  +  4, 


(19) 


(20) 


(21) 


where  the  relation  which  can  be  easily  verified,  has  been  used.  I  is  the  Lagrangian 

corresponding  to  the  system  without  constraints. 

The  work  done  by  the  fictitious  Rayleigh  forces  is 

SWr  =  -2(6^  fa  n  M  *  =  -2^q^*Ja  n  fi  9.  (22) 

Therefore  the  final  expression  obtained  by  the  application  of  the  Lagrange’s  equations  (3)  is 

Mq  +  a  A*  =  Q  +  Lq  -  Mq,  (23) 

where  A*  are  the  new  Lagrange  multipliers  of  the  modified  system.  Note  that  the  second  term 
in  the  LHS  of  equation  (23X  represents  the  projection  in  the  direction  of  the  coordinates  q  of 
^1  the  internal  forces  that  are  generated  by  the  penalty  system  when  the  constraints  #,  4  and 
9  are  violated.  Introducing  4  =  4^qq  -I-  #qq  +  the  final  result  is  obtained 

(M  +  *Ja*q)q  +  *jA* 

=  Q  +  Lq  -  Mq  —  ♦q  a  (^qq  +  ♦,  +  29fi9  +  .  (24) 

This  equation  may  be  viewed  as  the  “generic  penalty  method’’  [28]  to  which  the  Lagrange’s 
multipliers  are  added.  As  clearly  shown  in  [28]  this  augmented  Lagrangian  formulation  allows 
the  analyst  to  choose  from  a  wide  range  of  penalty  values  that  assure  convergence  and  avoid 
numerical  ill-conditioning.  As  we  will  see  later,  the  solution  provided  by  this  method  is  not 
sensitive  to  the  value  taken  by  the  penalty  factor,  and  therefore,  equation  (24)  represents  and 
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elegant  and  attractive  way  of  avoiding  the  problems  customarily  attributed  to  the  penalty 
formulations. 

It  is  important  to  note  that  there  is  a  very  important  difference  between  equation  (24) 
and  the  classical  dynamic  algorithms  represented  by  (4)  and  (15).  As  we  indicated  before, 
the  leading  matrices  of  the  latter  equations  become  singular  in  singular  positions.  However, 
although  the  mass  matrix  M  is  in  general  positive  semi-definite,  it  is  always  strictly  positive 
definite  in  the  nullspace  of  the  Jacobian  matrix.  Therefore,  a  look  at  equation  (24)  reveals  that 
its  leading  matrix  (M  -f-  ^q^a^q)  is  always  positive  definite,  which  meaits  that  it  can  always 
be  factorized,  even  in  singular  positions  and/or  with  linearly  dependent  constraints.  In  practice, 
the  augmented  Lagrangian  formulation  is  superior  to  the  generic  penalty  method  since  the 
former  allows  for  smaller  values  of  the  penalty  parameter,  hence,  for  a  better  conditioning  of 
the  leading  matrix. 

In  equation  (24)  the  Lagrange’s  multipliers  A*  play  the  role  of  correcting  terms.  In  the 
limit  the  constraint  conditions  are  satisfied,  thus  A  =  A*  and  equations  (4)  and  (24)  become 
^uivalent  except  for  round  off  errors  induced  by  the  penalty  parameter  and  finite  machine 
precision.  By  comparing  those  two  equations  one  can  infer  that 

A  s  A*  4- «  (*  -I-  2n/i*  +  n^*).  (25) 

We  are  seeking  the  solution  of  (24)  without  having  to  use  the  algebraic  constraint  equations 
(1).  This  requires  that  the  correct  values  of  A*  be  knov^m  so  that  they  can  be  inserted  in  (24). 
Since  those  values  are  not  known  in  advance  we  need  to  set  up  an  iterative  proce.:s  that 
calculates  the  unknown  multiplier  A*.  The  iteration  is  easily  established  by  taking  advantage 
of  equation  (25) 

A*+i  =  A*  +  4-  20/4*  4-  *  =  0, 1,2,...  (26) 

with  Ag  =  0  for  the  first  iteration.  Equation  (26)  physically  represents  the  introduction  at 
iteration  t  4- 1  of  forces  that  tend  to  compensate  the  fact  that  the  addition  of  all  the  constraint 
terms  are  not  exactly  zero.  It  turns  out  that  with  the  augmented  Lagrangian  formulation,  the 
penalty  numbers  do  not  need  to  be  very  large  (thus  leading  to  a  better  numerical  conditioning) 
since  the  resulting  error  in  the  constraint  equations  will  be  eliminated  by  the  Lagrange's  terms 
during  the  iteration  procedure.  Also  note  that  the  "generic  penalty”  [28]  method  corresponds 
to  the  augmented  Lagrangian  formulation  in  which  the  iteration  process  is  only  carried  out 
once. 

The  matrix  formulation  of  (24),  including  the  iterative  process  defined  in  (26),  is  given  by 
the  following  expression: 

4-  *q  a*q)  =  Mq^  -  *q  a  (*qq  4-  4-  20/4*  4-  O**^ 

t  =  0,1,2,...  (27) 

where  the  subscript  i  represents  the  iteration  number,  and  M  qo  =  Q+Lq  -  Mq  for  the 
initial  iteration.  Equation  (27)  may  be  used  to  iterate  until  Hq^*'*'*^  -  q^‘^  |I  <  e,  where  e  is  a 
user-specified  tolerance. 

The  main  advantage  of  using  equation  (27)  is  that  the  penalty  terms  are  in  fact  used  as  an 
intermediate  tool  in  order  to  compute  the  Lagrange’s  multipliers  for  which  no  new  equations 
are  integrated:  only  n  equations  are  solved  in  the  integration  process.  Therefore,  the  value  of 
the  penalty  factor  a  docs  not  affect  the  solution,  but  only  the  convergence  rate.  Experience 
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shows  that  when  the  constraints  are  scaled  to  unity,  penalty  factors  ranging  from  10^  to  10^ 
give  a  good  convergence  rate,  and  only  2  to  4  iterations  are  required  to  converge  to  the  machine 
precision,  in  double  precision  arithmetic. 

Note  that  the  added  cost  of  using  equation  (27)  to  refine  the  solution  and  obtain  the 
Lagrange  multipliers  is  fairiy  small,  since  its  leading  matrix  remains  constant  during  the 
iteration  process  needed  for  a  function  evaluation.  Therefore,  at  each  iteration  step  only  the 
computation  of  the  independent  tenn  and  a  forward  and  a  backward  substitutions  are  required. 
The  numerical  implementation  of  the  algorithm  using  standard  integrators  [33],  available  in 
commercial  mathematical  libraries,  is  rather  simple  and  may  be  described  as  follows 

ALGORITHM  ALFl 
Given  q  and  q  at  time  step  1. 

1.  Use  (27)  iteratively  to  solve  for  q,  with  M  qo  =  Q  +  -  Mq  for  the  initial  iteration. 

*  At  the  end  of  each  iteration  use  (26)  to  calculate  the  Lagrange  multipliers  A*,  if  desired 

2.  Call  the  numerical  integration  subroutine  (n.i.s)  to  compute  q  and  q  at  time  step  1  +  1. 

3.  Upon  convergence  of  the  n.i.s  update  the  time  variable  and  go  to  step  1. 

We  have  used  this  algorithm  very  successfully  in  multibody  dynamics  simulation  and  has 
turned  out  to  be  very  efficient  and  accurate.  However,  we  have  noticed  that  under  repetitive 
singular  conditions  this  algorithm  may  lead  to  unstable  behavior  (see  examples  below)  due  to 
the  accumulation  of  small  violations  of  the  constraints  during  the  integration  process.  This  lead 
us  to  propose  a  more  robust  augmented  Lagrangian  method  based  on  the  carmnical  equations 
of  Hamilton  that  is  presertted  in  the  next  section. 

4.  Augmented  Lagrangian  Formulation  in  Canonical  Form 

4.1.  Basic  Augmented  Lagrangian  Formulation  in  Canonical  Form 

Let  us  consider  equation  (9)  as  the  starting  point  to  build  a  modified  Lagrangian  that  will  not 
only  contain  the  Lagrange  multipliers  a  but  also  the  penalty  terms  of  the  previous  section. 
Accordingly: 

=  L  +  +  (28) 

In  the  limit  when  the  constraint  conditions  are  satisfied,  the  penalty  terms  vanish  and  a  =  a*. 
Similar  to  the  L^range’s  formulation  a*  =  A*  and  after  the  augmented  Lagrangian  iteration 
when  the  constraints  are  satisfied  to  machine  precision  a  =  X.  The  differentiation  of  L*  with 
respect  to  q  leads  to  the  following  new  canonical  momenta  in  matrix  form 

p  =  ^  =  Mq  +  *Ja*  +  *J<r*,  (29) 

where  we  have  eliminated  the  subindex  ‘rww’  of  (10a)  for  practical  convenience.  The  modified 
Hamiltonian  can  be  written  as  W*  =  p^q  -  L*  and  the  use  of  (7),  including  the  Rayleigh 
forces  of  (18),  leads  to 

a#qj  q  =  p  -  ^q  a*t  +  ^qCr* 


(30a) 
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P=Q  +  Lq  +  +  ijtr*.  (30b) 

Equations  (30a-b)  constitute  a  set  of  2n  first  order  ordinary  differential  equations.  However, 
p  is  given  in  explicit  fonn,  and  therefore  only  n  algebraic  equations  need  be  solved  at  each 
function  evaluation  for  the  numerical  implementation  of  the  algorithm. 

Our  numerical  simulations  have  shown  that  equations  (30)  tend  to  be  numerically  stiff  due 
to  all  the  penalty  terms  concentrated  in  the  RHS  of  (30b).  This  numerical  stifhiess  limits  the 
possible  choices  of  numerical  integrators.  Stwdard  ODE  integrators  [33]  that  are  based  on 
conditionally  stable  predictor-corrector  multi-step  formulae,  lead  to  an  increased  number  of 
function  evaluations.  We  propose  in  the  next  section  a  modification  of  (30)  that  circumvents 
this  problem. 


4.2.  Modified  augmented  Lagrangian  Formulation  in  Canonical  Form 
The  canonical  equation  (30a)  may  be  also  written  as 

P  =  (31) 

which  indicates  that  the  canonical  momenta  is  stabilized  through  the  addition  of  penalty 
terms  that  are  proportional  to  the  violation  of  the  velocity  constraint  equations.  It  is  important 
to  realize  that  if  equation  (31)  is  differentiated  and  substituted  into  the  acceleration  based 
augmented  Lagrangian  equation  (24)  the  result  is  precisely  the  additional  canonical  equation 
(30b),  which  lead  us  to  see  that  the  canonical  equations  originate  from  the  acceleration  based 
equations  by  the  mere  canonical  tranrformation  indicated  in  (31). 

However,  we  can  achieve  a  better  stabilization  of  the  canonical  momenta  if  we  add  to  the 
RHS  of  (31)  two  additional  penalty  terms:  one  term  proportional  to  the  constraint  violation 
and  the  other  to  its  integral.  Accordingly  we  define  a  new  momenta  p  as 


p  =  Mq  -I-  -F  2pn«  A- -  * 


By  expanding  the  term  #  equation  (32)  becomes: 


(32) 


(m  +  q  =  p  -  -F  2^0*  +  O*  ^dtj  -  #  Ja*.  (33a) 

The  differentiation  of  (33a)  and  substitution  into  (24)  leads  to  the  second  set  of  modified 
canonical  equations 


P  =  Q  -F  f:,  -F  -F  -F  (33b) 

which  along  with  (33a)  constitute  a  set  of  2n  first  order  ordinary  differential  equations  in  the 
unknowns  p,  q  and  a*.  Again  only  n  algebraic  equations  need  be  solved  at  each  hmetion 
evaluation  for  the  numerical  implementation  of  the  algorithm.  A  very  important  point  is  that, 
contrary  to  equations  (30a-b),  equations  (33a-b)  do  not  become  stiff,  and  all  our  numerical 
experiments  show  that  they  even  provide  more  numerical  accuracy  and  better  constraint 
stabilization  than  the  acceleration  based  formul^on  of  equation  (24). 

In  fact  we  can  compare  this  set  of  equations  with  the  n  second  order  ordinary  differential 
equations  resulting  from  the  acceleration  based  formulation  of  (24).  While  both  formulations 
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require  the  iriangularization  of  the  same  leading  matrix  for  each  fimetion  evaluation,  there 
is  a  serious  advantages  in  the  use  of  (33a-b)  as  compared  to  (24):  the  kinematic  constraint 
conditions  are  differentiated  only  once  with  the  canonical  procedure  (twice  in  the  acceleration 
based  formulation)  and  this  will  lead  to  lesser  violations  of  the  constraints.  We  will  see  in  the 
numerical  simulations  of  Section  6.  how  this  factor  becomes  detrimental  for  the  acceleration 
based  fonnulation  under  repetitive  singular  positions,  whereas  the  canonical  approach  leads 
to  a  much  better  performance. 

Note  again,  that  the  multipliers  a*  do  not  iteed  to  be  solved  for  explicitly.  Following  the 
same  procedure  as  that  used  with  the  acceleration  based  augmented  Lagrangian  formulation, 
the  cr*  may  be  obtained  in  an  iterative  manner  as: 

+  +  ,  i=0,l,2,...  (34) 

with  (Tq  =  0  for  the  first  iteration.  Equation  (33a)  including  the  iterative  process  of  (34) 
becomes 

(m  +  a#q)  qj+i  =  a  +  2/in*  j  j  > 

i  =  0,1,2,...  (35) 

with  M  qo  =  p  for  the  first  iteration.  Equation  (35)  shows  that  the  velocity  calculation  at 
each  function  evaluation  is  refined  so  that  the  weighted  summation  of  the  constraint  equations 
(34)  are  satisfied  to  machine  precision.  After  the  velocity  calculation  equation  (33b)  may  be 
used  to  evaluate  the  derivative  of  the  canonical  momenta. 

The  algorithm  may  be  presented  as 

ALGORITHM  ALF2 
Given  p  and  q  at  time  step  1. 

1.  Use  (35)  itcrativciy  to  solve  for  q,  with  M  q©  =  p  for  the  first  iteration.  At  the  end  of 
each  iteration  use  (34)  to  calculate  the  Lagrange  multipliers  a* 

2.  Use  (33b)  to  compute  p  explicitly  (no  solution  of  equations  involved). 

3.  Call  the  numerical  integration  subroutine  to  compute  p  and  q  at  time  step  1  +  1. 

4.  Upon  convergence  of  the  n.i.s. 

•  If  desired,  use  a  differentiation  scheme  to  obtain  \  =  & 

•  Update  the  time  variable  and  go  to  step  1. 

This  algorithm  is  as  efficient  numerically  as  ALFl  but  much  more  stable  under  repetitive 
singular  positions. 

5.  Canonical  Augmented  Lagrangian  Formulation  for  Non-Holonomic  Systems 

The  modified  augmented  Lagrangian  fonnulation  described  above  may  also  be  extended  to 
non-holonomic  systems  with  coikstraints  of  the  form 

*(q,q,t)  =  0.  (36) 

The  acceleration  based  augmented  Lagrangian  formulation  for  this  type  of  constraints  is: 

Mq  =  Q  +  Lq-Mq-*Ja(*  +  ;3*) (37) 
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In  order  to  obtain  the  canonical  counterparts  we  follow  a  procedure  similar  to  that  used  for 
the  holonomic  case,  and  establish  the  following  canonical  transformation: 


p  =  Mq  +  0  j 


(38a) 


which  indicates  that  a  better  stabilization  of  the  canonical  momenta  may  be  achieved  by 
considering  one  penalty  term  proportional  to  the  constraint  violation  and  other  to  its  integral. 
The  diflferentiation  of  (38a)  and  posterior  substimtion  into  (37)  leads  to  the  second  set  of 
canonical  equations 

p  =  Q  +  Iq  +  (38b) 


which  along  with  (38a)  constitute  a  set  of  2n  first  order  ordinary  differential  equations  in  the 
unknowns  p,  q  and  a*.  Again  only  n  equations  need  be  solved  at  each  function  evaluation. 

IVpically,  non-holonomic  constraint  conditions  for  multibody  systems  take  the  following 
form 


^  =  A(q,0  q  +  B(q,t) 

and  consequently  the  application  of  (38a-b)  leads  to 

(39) 

-b  A'^aA^  q  5=  p  -  A^a  (3  j  -  A'^ct* 

(40a) 

p  =  Q  +  Lq  +  A^a  ^ 

(40b) 

and 

aUi=cr:-h(*  +  /3 /'*dT)  ,  i  =  0,1,2,... 

\  Jto  /  »+l 

(41) 

with  (Tq  =0  for  the  first  iteration. 


6.  Numerical  Examples 
6.1.  A  SIMPLE  EXAMPLE 

To  better  understand  the  application  of  the  augmented  Lagrangian  formulation  in  singular  and 
non-singular  positions,  let  us  consider  the  slider-crank  mechanism  shown  in  Figure  1.  Both 
links  are  of  length  1  =  Im,  with  a  uniformly  distributed  mass  of  m  =  1  Kg.  We  take  as  position 
coordinates  q,  the  x  and  y  coordinates  of  the  crank  end,  and  the  x  coordinate  of  the  slider, 
thus  q”^  =  {xi,  yi,  X2).  We  consider  the  gravity  force,  with  a  value  5  =  9.81  m/s^  acting  in 
the  negative  Y  axis  direction. 

The  3x3  mass  matrix  corresponding  to  these  variables  is 


■4  0  r 

0  4  0 
I  0  2 
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Table  I.  Convergence  rate  with 
a  =  10* 


Iteration  # 

Error 

1 

6.5792  10“* 

2 

4J705  10"* 

3 

2.9206  1--'* 

4 

1.6107  10"'* 

This  mechanism  has  one  degree  of  freedom  only,  and  therefore  there  are  two  geometrical 
constraints  that  correspond  to  the  constant  distance  conditions 


«  = 


\  j  [(*2  -  +  yf  - 1]  j 


When  the  crank  fonns  an  angle  of  Ttfl  radians  with  the  horizontal,  the  coupler  is  coincident 
with  the  crank  and  the  crank  axis  is  also  coincident  with  the  slider  In  this  position  the 
mechanism  has  two  instantaneous  degrees  of  freedom,  since  it  can  undergo  either  the  motion 
of  a  slider-crank  or  the  motion  of  two  superimposed  rotating  bars.  Let  us  now  apply  the 
algorithm  ALFl  for  the  instantaneous  solution  of  the  accelerations,  for  both  a  nonsingular 
position  and  a  singular  position. 


Nonsingular  Position.  Consider  the  mechanism  in  an  initial  position  in  which  the  crank  forms 
an  angle  of  n/A  with  the  horizontal  and  in  which  the  slider  has  a  velocity  it  —  -2  m/s.  The 
exact  acceleration  has  been  computed  first  with  the  classical  Lagrange’s  multiplier  method  of 
equation  (4).  Then,  the  accelerations  have  been  calculated  with  the  algorithm  ALFl,  using 
equation  (27)  iteratively,  with  a  value  a  =  10^.  Table  I  shows  the  norm  of  the  difference 
between  the  exact  acceleration  and  the  one  obtained  with  ALFl. 

Table  I  also  shows  that  the  convergence  rate  of  the  iterative  algorithm  is  considerably  fast 
ihis  rate  agrees  with  that  predicted  analytically  in  (36J.  A  higher  penalty  value  gives  a  faster 
convergence  rate  but  a  lower  precision,  ^r  instance,  a  value  of  a  =  10^  yields  an  error  of  the 
order  of  10-12  in  one  iteration,  however,  further  iterations  are  unable  to  improve  the  solution, 
siiKe  some  precision  is  lost  in  floating  point  arithmetic  operations  between  numbers  with 
exponents  of  significantly  different  values. 


Singular  Position.  Now,  consider  the  crank  in  a  vertical  position,  forming  and  angle  of  n/l 
radians  with  the  horizontal.  As  vre  did  in  the  nonsingular  case,  we  take  again  a  slider  velocity 
valuexz  =  -2  m/s.  Since  the  mechanism  is  in  a  singular  position  with  2  instantaneous  degrees 
of  freedom,  we  also  have  to  specify  the  horizontal  velocity  of  the  crank  end.  It  can  be  easily 
shown  that,  theoretically,  the  crank  end  can  have  any  velocity  value  x\  —  v.  However,  the 
slider-crank  motion  must  satisfy  the  condition  xi  =  xz/l  over  all  its  motion,  and  therefore 
the  velocity  zi  =  -1  seems  the  obvious  choice.  Note  that  in  this  example  the  choice  for  the 
crank-end  velocity  is  being  made  explicitly,  but  during  a  dynamic  simulation  the  numerical 
integrator  will  provide  the  value  of  the  crank-end  velocity.  Since  the  integrator  assumes  a 
continuous  variation  of  the  variables,  this  condition  will  be  automatically  guaranteed. 
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In  this  case,  the  exact  acceleration  value  cannot  be  computed  with  equation  (4)  because  the 
leading  matrix  is  singular.  However,  the  application  of  equation  (27)  with  a  value  of  a  =  ICI* 
leads  to 


0  2 
Lg 


0 

(lO^  +  j 

0 


0 


0 

(2(10)^ +  9.81) 
0 


1 


which  can  be  inverted  and  leads  to  the  solution  (0,  -1.000473825436242729,  0).  After  3 
iterations,  the  result  is  (0,  - 1 ,  0),  accurate  to  14  digits. 

This  simple  example  clearly  and  simply  illustrates  that  the  penalty-augmented  Lagrang¬ 
ian  formulation  works  in  singular  positions,  when  the  classical  fonnulations,  such  as  the 
Lagrange’s  multipliers  method  or  the  reduction  to  independent  coordinates,  fail.  Also  note 
that  the  condition  number  of  the  leading  matrix  increases  at  the  same  rate  as  the  penalty 
parameter. 


6.2.  Dynamic  Smulation  of  toe  Slider-Crank  Mechanism 

Let  us  consider  again  the  same  slider-crank  mechanism  of  Section  6.1,  in  an  initial  position 
such  that  the  crank  forms  an  angle  ofir/4  radians  with  the  X  axis  and  that  the  slider’s  velocity 
is  X2  =  —4  m/s. 

We  perform  a  dynamic  simulation  by  integrating  the  equations  of  motion  for  a  total  of  10 
seconds,  using  a  conditionally  stable  variable  step  and  onler  integrator  based  on  predictor- 
corrector  multistep  formulae  [33].  We  set  the  error  tolerance  to  10~*  and  choose  as  penalty 
parameters  a  =  10^,  fl  =  10  and  /i  =  1.  During  the  simulation,  the  mechanism  goes  through 
the  singular  position  1 1  times,  following  a  periodical  response. 

First,  the  simulation  was  carried  out  with  the  acceleration  based  algorithm  ALFl.  Figure  3 
shows  the  X  acceleration  of  the  crank-end  over  the  time  period  of  10  seconds.  Figure  4  shows 
the  value  of  the  Lagrange  multiplier  Ai ,  corresponding  to  the  constant  distance  constraint 
condition  between  the  crank  axis  and  the  crank  end.  Finally,  Figure  5  shows  the  time  history 
of  the  total  energy,  which  should  be  kept  constant,  since  the  system  is  conservative.  A  very 
interesting  point  can  be  noted  in  Figures  3,  4  and  5.  The  value  of  the  acceleration  of  the 
crank-end  and  A]  present  spikes  around  t  =  9.25  s  and,  at  the  same  time,  the  eneigy  presents 
a  sudden  discontinuity.  The  cause  of  this  phenomenon  is  a  small  violation  of  the  constraints 
around  the  singular  position,  due  to  the  combination  of  the  errors  produced  by  the  numerical 
integration  routine  and  by  the  round-off  errors  produced  by  augmented  Lagrangian  procedure. 
These  errors  are  more  critical  in  the  acceleration  based  algorithm  ALFl  because  the  constraint 
equations  are  differentiated  twice. 

The  simulation  was  repeated,  this  time  using  the  algorithm  ALF2,  with  the  same  error 
tolerance  and  values  for  the  penalty  parameters.  This  time,  the  values  of  Ai  and  the  crank-end 
acceleration,  illustrated  in  Figures  6  and  7,  no  longer  show  the  spikes  resulting  from  ALFl. 
In  addition,  the  total  energy,  shown  in  Figure  8,  does  not  show  the  sudden  discontinuity  that 
results  in  Figure  5. 

The  accumulation  of  integration  errors  that  lead  to  small  constraint  violations  in  the 
neighborhood  of  the  singular  position  is  the  cause  for  the  sudden  peaks  and  jumps  in  the 
constraint  forces  and  accelerations  produced  by  ALFl .  These  can  be  removed  by  tighter  error 
tolerances  in  the  integrator.  The  better  results  obtained  under  the  same  conditions  with  ALF2 
are  due  to  its  better  constraint  stabilization  properties. 
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Fig.  3.  Acceleration  of  the  crank  end  with  ALFl. 


Fig.  4.  Lagrange’s  moltipUer  with  ALFl. 


Fig.  5.  Total  energy  with  ALFl. 


Energy  (j) 


1 8  E.  Bayo  and  A.  Avello 

1  2  3 


Fig.  10.  "nine  histoiy  of  xj  with  ALPl. 


6.3.  An  Assembly  of  Two  Four-Bar  Linkages 

Figure  9  shows  the  initial  position  of  a  one  degree-of-freedom  assembly  of  two  four-bar 
linkages.  This  mechanism  constitutes  a  particularly  critical  example,  because  when  it  reaches 
the  horizontal  positionthe  numberof  degreesof  freedom  increases  instantaneouslyfrom  1  to  3. 
To  define  the  position  of  the  system,  we  use  the  6  position  variables  (zi ,  yi,  X2,  yz,  xs,  j^). 
All  the  links  are  of  length  1  =  1  m  and  have  a  unifonnly  distributed  mass  m  =  1  Kg.  The 
gravity  force  acts  in  the  negative  Y  direction,  with  a  value  ^  =  9.8 1  m/s^.  At  t  =  0  the  initial 
velocity  is  ii  =  1.  We  integrate  the  motion  for  10  seconds,  using  the  same  integrator  and 
tolerance  as  before,  and  the  values  a  =  10^,  n  =  10  and  im=  1  for  the  penalty  parameters. 

The  analysis  was  carried,  out  twice,  first  with  the  algorithm  ALFl  and  then  with  the 
algorithm  ALF2.  The  results  obtained  with  ALFl  are  displayed  in  Figures  10  and  11,  which 
show  the  time  variation  of  the  coordinate  xi  and  the  Lagrange  multiplier  Ai ,  corresponding  to 
a  constant  distance  constraint  between  point  1  and  the  fixed  end  of  the  leftmost  link.  Figures 
12  and  13  show  the  variation  of  the  same  variables,  obtained  this  time  with  the  algorithm 
ALF2.  As  may  be  seen,  the  solution  writh  ALFl  becomes  unstable  after  3.3  seconds,  while 
ALF2  gives  congruent  results. 

The  reason  for  the  failure  of  ALFl  and  the  success  of  ALF2  are  found  again  in  the  better 
stability  properties  of  ALF2  with  respect  to  constraint  violations  (it  even  yields  a  successful! 
integration  when  just  the  generic  penalty  formulation  is  used  with  no  augmented  lagrangian 
iteration).  The  way  ALFl  may  be  improved,  if  it  is  to  be  used  in  repetitive  singular  positions, 
is  by  setting  tighter  error  tolerances  and  rising  the  value  of  the  parameter  Q.  However,  this  will 
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introduce  numerical  stiffness  in  the  problem  and  therefore  will  increase  the  computational 
effort.  In  this  example,  the  value  of  w  =  20  solves  the  problem  satisfactorily  at  the  cost  of  a 
lengjiier  integration. 

7.  Conclusions 

In  this  paper  we  have  concisely  reviewed  the  state  of  the  art  in  multibody  dynamic  simulation. 
We  have  also  revisited  the  acceleration  based  augmented  Lagrangian  formulation  in  the  context 
o*"  sinpilar  positions  (ALFl)  and  proposed  a  new  one  based  on  the  canonical  equations 
of  Hamilton  (ALF2)  for  both  holonomic  and  twn-holonomic  systems.  Both  formulations, 
ALFl  and  ALF2,  successfully  solve  the  simulation  problem  in  singular  positions,  however 
the  canonical  formulation  ALF2  proves  to  be  more  accurate  and  robust  than  ALFl  under 
repetitive  singular  configurations. 

The  advantages  of  the  proposed  method  can  be  summarized  as  follows: 

•  The  metitod  is  very  simple  to  implement  aitd  can  use  standard  off  the  shelf  conditionally 
stable  numerical  integrators  such  as  those  available  in  commercial  mathematical  libraries. 

•  The  fact  that  the  leading  matrix  of  the  equations  of  motion  is  always  positive  definite, 
symmetric  and  sparse,  allows  for  a  very  efficient  solution  of  the  equations  without  the 
use  of  pivoting.  This  applies  even  in  the  presence  of  redundant  (linearly  dependent) 
constraints  and  coordinates,  and  most  importantly  in  singular  positions. 

•  Both  the  generic  penalty  and  augmented  Lagrangian  methods  do  not  require  special  pro¬ 
visions  such  as,  detection,  elimination  of  constraints  or  regularization,  near  the  singular 
position.  The  integration  goes  through  the  singularity  in  a  procession  maimer  with  no 
need  for  additional  changes. 

•  The  Lagrange  multipliers  (reaction  forces  at  the  constraints)  are  obtained  without  having 
to  integrate  additional  equations. 

•  The  leading  matrix  is  strongly  banded,  feamre  that  in  principle  makes  it  an  order  n 
method,  where  n  is  the  number  of  depeiKient  coordinates.  Therefore  it  may  become  a 
very  efficient  formulation  for  those  systems  with  a  large  number  of  multibodies,  although 
this  assertion  needs  to  be  corroborated  by  further  research. 

•  The  acceleration  based  formulation  ALFl  shows  numerical  inestabilities  undr*’  repetitive 
singular  positions  that  arc  due  to  the  accumulation  of  round-off  and  constraint  errors. 
These  can  be  circumvented  with  tighter  tolerances  and  iiKieased  values  in  the  frequency 
of  the  dynamical  penalty  system  at  the  expense  of  additional  computational  cost 

•  The  canonically  based  method  ALF2  is  more  obust  and  has  not  shown  pathological 
behavior  in  any  of  our  simulations  (even  when  we  used  it  in  the  generic  penalty  way). 
There  authors  do  not  know  of  any  other  algorithm  that  can  simulate  the  motion  of  a 
multibody  undergoing  repetitive  singular  positions  as  ALF2  does. 

As  a  final  remark,  one  must  keep  in  mind  that  the  actual  behavior  of  multibody  systems 
around  singular  positions  is  physically  uncertain,  due  to  the  uncertainty  in  the  manufacturing 
tolerances.  It  is,  therefore,  unlikely  that  the  behavior  of  the  systems  simulated  with  the 
algorithms  presented  in  this  paper  may  be  experimentally  reproduced.  However,  the  usefulness 
of  the  algorithms  and  numerical  results  presented  herein  is  twofold.  Firstly,  they  provide  an 
efficient  and  reliable  tool  for  multibody  dynamics,  which  avoids  the  program  crashing  that 
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occurs  with  the  classical  formulations.  Secondly,  these  algorithms  become  useful  for  the  study 
of  the  different  alternative  motions  that  a  multibody  system  may  undergo  in  the  neighborhood 
of  a  singular  position,  when  one  or  several  geometrical  parameters  are  slightly  varied  to 
simulate  manufacturing  errors. 
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ABSTRACT 

This  paper  addresses  the  problem  of  inverse  dynamics  for  articulated 
flexible  structures  with  both  lumped  and  distributed  actuators.  This  prob¬ 
lem  arises,  for  example,  in  the  combined  vibration  minimization  and  tra¬ 
jectory  control  of  space  robots  and  structures.  For  such  flexible  structures, 
closed  loop  passive  joint  based  controllers  have  been  shown  to  be  effec¬ 
tive  in  trajectory  control  by  Paden  et  al.  Crucial  to  the  development  of 
such  closed  loop  controllers,  which  are  robust  to  external  perturbations,  is 
the  problem  of  dynamic  inversion  which  yields  the  nominal  state  trajec¬ 
tories  and  the  feedforward  inputs.  In  this  paper  we  propose  a  new  inverse 
dynamics  scheme  for  computing  the  nominal  lumped  and  distributed  feed¬ 
forward  inputs  for  tracking  a  prescribed  trajectory. 
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1.  Introduction 

Inverse  dynamics  is  an  important  problem  in  the  control  of  aniculated  flexible  struc- 
uires  such  as  space  stations  and  manipulators.  A  solution  for  the  nonredundant  lumped 
actuator  case  has  been  provided  by  Bayo  et.  ai.,  [1]  and  Book,  [2].  This  method  produces 
bounded  inputs  which  move  a  reference  point  on  the  structure  along  a  desired  trajectory. 
The  inputs  are  necessarily  non-causal  when  the  structure  dynamics  are  nonminimum 
phase.  Elastic  deformation  which  may  cause  vibration  of  the  structure  is  also  determined 
by  the  trajectory;  our  goal  is  to  minimize  such  vibrations.  The  viability  of  distributed 
actuators  for  the  control  of  structural  vibrations,  [3] ,  [4]  and  [5] ,  has  motivated  their  use 
here  for  trajectory  tracking. 

Trajectory  tracking  of  the  structure  can  be  accomplished  by  the  use  of  the  joint 
actuators  alone  [6]  and  in  this  sense  the  distributed  actuators  are  redundant.  However,  if 
only  the  joint  actuators  arc  used,  once  the  trajectory  to  be  tracked  is  prescribed,  the  feed¬ 
forward  inputs  and  consequently  the  induced  structural  vibrations  during  motion  are 
determined  uniquely.  Note  that  there  arc  many  state  trajectories  that  could  yield  the  same 
output  response,  but  we  can  only  access  a  particular  one,  with  a  given  set  of  non¬ 
redundant  number  of  actuators.  On  the  contrary  if  additional  actuation  is  available,  we 
have  the  freedom  of  choosing  a  more  favorable  state  dynamics.  In  this  paper,  we  develop 
the  concept  of  using  the  extra  actuation  available  through  the  distributed  actuators  in  the 
structure  to  not  only  satisfy  the  trajectory  tracking  constraint  but  also  minimize  the 
accompanying  elastic  displacements  during  the  motion.  To  obtain  these  new  feedforward 
inputs,  the  inverse  dynamics  method  suggested  in  [1]  is  extended  to  cover  cases  of 
redundantly-actuated  structures.  To  verify  the  efficay  of  the  proposed  method,  an  exam¬ 
ple  of  a  flexible  two  link  truss  structure  with  distributed  piezo-electric  actuators  was  stu¬ 
died  to  contrast  the  use  of  distributed  actuators  along  with  the  joint  actuators  for  end 
effector  trajectory  control  with  the  use  of  only  the  joint  actuators.  The  inverse  dynamics 
problem  studied  in  this  paper,  yields  the  nominal  desired  state  trajectories  and  the  feed- 
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fon^ard  input  which  can  be  used  to  develop  closed  loop  controllers  to  achieve  robust  tra¬ 
jectory  tracking!  6]. 

The  remainder  of  the  paper  is  organized  in  the  following  format.  The  modeling  of 
flexible  structures  with  joint  and  distributed  actuators,  the  formulation  of  the  problem  and 
its  solution  are  presented  in  Section  2,  Section  3  deals  with  an  application  of  the  pro¬ 
posed  method  to  the  example  of  a  two  link  flexible  truss.  The  discussions  and  conclu¬ 
sions  are  presented  in  section  4. 

2.  Formulation 

The  solution  to  the  general  multi-link  inverse  dynamics  problem  involves  studying 
an  individual  link  in  the  chain,  coupling  the  equations  of  the  individual  links,  and  then 
recursively  converging  to  the  desired  actuator  inputs  and  corresponding  displacements. 
This  approach  is  presented  below,  beginning  with  a  single  link, 

2.1  Equation  of  motion  of  a  single  link 

To  simplify  the  equations,  we  present  the  equations  for  a  link  with  a  revolute  joint. 
The  flexible  link  depicted  in  figure  1  fonns  part  of  a  multi-link  system.  The  link  is  shown 
with  a  revolute  joint,  however  the  formulation  remains  identical  for  a  link  with  transla¬ 
tional  joint  The  elastic  deflections  in  the  structure  are  defined  with  respect  to  a  nominal 
position  characterized  by  a  moving  frame  whose  origin  coincides  with  the  location  of  the 
hub  of  the  link.  The  nominal  motion  of  this  frame  is  prespecified  by  its  angular  velocity 
CO*,  angular  acceleration  a*  and  the  translational  motion  of  its  origin.  The  above 
definition  of  the  elastic  displacements  with  respect  to  this  nominal  frame  permits  the 
linearization  of  the  problem  from  the  outset  Incorporating  the  kinematic  model  followed 
by  Naganathan  and  Soni  [7]  in  a  finite  element  model  (FEM),  the  equations  of  motion  for 
a  single  link  at  any  time  t  can  be  written  as  [1] 

A/z -(-[c +Cc(coa)]  i z  =BtT  +BpVp +F.  (2.1) 
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Where  z  is  an  /?'*  vector  of  the  finite  element  degrees  of  freedom.  M  and  K  belong  to 
^nxn  aj.g  jjjg  conventional  finite  element  mass  and  stiffness  matrices  respectively; 
Cc  and  Kc  e  and  are  the  time  varying  Coriolis  and  centrifugal  stiffness  matrices, 
respectively.  The  matrix  C  represents  the  internal  viscous  damping  of  the  material. 
T  is  the  unknown  joint  actuation.  F  g  R'*  contains  the  reactions  at  the  end  of  the  link, 
and  the  known  forces  produced  by  the  rotating  frame  effect.  The  distributed  actuator 
input,  Vj,  (r)  e  Z?'^ ,  is  the  equivalent  nodal  forces  at  the  FEM  degrees  of  freedom,  where 
np  is  the  number  of  distributed  actuator  inputs.  Bj-  and  Bp  are  constant  matrices  of 
dimensions  Z?"  and  Z?"^ ,  respectively.  The  set  of  finite  element  equations  (2.1)  may  be 
partitioned  as  follows 
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where  0*  is  the  elastic  rotation  of  the  hub,  Zt  is  the  elastic  deflection  at  the  tip  in  the  y 
’irection,  and  the  other  n  -2  finite  element  degrees  of  freedom  arc  included  in  the  vector 
Zi .  The  force  vector,  F ,  and  the  Bp  and  Bj  matrices  are  also  partitioned  similarly. 


2.2  Minimization  Objective 

The  requirement  is  to  accurately  track  the  end  effector  of  the  link  along  the  given 
nominal  trajectory  without  overshoot  and  residual  vibrations.  Additionally  we  also  seek 
to  minimize  the  ensuing  structural  vibrations  during  this  motion  by  minimizing  J(T,Vp), 
a  measure  of  elastic  deflections  in  the  structure  defined  as  follows 


J{T,Vp)  4  j  z{tfz{t)dt. 


(2.3) 
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Mathematically  the  objective  can  be  stated  as 

(2.4) 

Where  f  is  the  set  of  all  pairs  of  stable  joint  torque  and  distributed  actuator  inputs  that 
when  used  to  actuate  the  system  defined  by  equation  (2.2)  yields  z,(r)  =  0  for  all  r. 

2.3  Solution  Methodology 

An  iterative  scheme  is  described  below  for  each  link.  Equation  (2.2)  can  be  rewrit¬ 
ten  as 


Mz-^Cz-^K2=BTT-^BpVp+F-CA(i>H)i  - /(Tc (a* .co* ) z  (2.5) 

where  the  time  dependent  Coriolis  and  centrifugal  terms  are  kept  on  the  RHS  of  the 
equation.  The  iteration  procedure  starts  with  the  absence  of  the  last  two  terms  involving 
Cc  and  Kc  in  the  right  hand  side.  Then,  the  system  of  equations  can  be  transformed  into 
independent  sets  of  simultaneous  complex  equations  by  means  of  the  Fourier  transform. 
For  each  of  the  evaluation  frequency  equation  (2.5)  becomes 


r 


M  -  -i-C  -  -T/ 

i  CO  co^ 


1 


•  ^ 

Zk 

■  1 

•  • 

- 

— 

f 

Fj, 

Fp. 

Q 

+ 

Fi 

+ 

J 

0 

Ft 

Fp, 

/■ 

-  - 

- 

(2.6) 


terms.  After  the  first  iteration  it  will  also  include  the  updated  contributions  from  the 
Coriolis  and  centrifugal  terms  appearing  in  the  RHS  of  equation  (2.5).  For  any  ©  0,  the 
matrix 


M  -  -^C  -  -^K 


1© 


(2.7) 


is  a  complex,  S3rmmetric  and  invertible  matrix.  For  ©  =  0  the  system  undergoes  a  rigid 
body  motion  and  H  ^  M  which  is  the  positive  definite  invertible  mass  matrix.  Let 
G  ^  H~K  Then  the  above  equation  can  be  re-written  as 


(2.8) 
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The  condition  that  the  tip  should  follow  the  nominal  motion  is  equivalent  to  =0  for  all 
CO.  This  induces  a  relationship  between  the  joint  actuation  and  the  distributed  actuator 
inputs  and  is  obtained  from  the  last  row  of  the  previous  equation. 


T  =-G,a-i 


G/a  0,i  G„ 


(F  +BpVf,). 


(2.9) 


Substituting  this  expression  for  the  input  hub  torque  in  equation  (2.8)  and  using  the  pro- 


petty  that  =  -arz  yields 


Where 


A  ^  [-G,a-i  GB7-(G,a  G„  G„)  +  G]  fl;, 
and 


(2.10) 


(2.11) 


B  ^  [-G,a-i  GBr(G,A  G^  G„)  +  G]  f  .  (2.12) 

Next  we  determine  Vp.  Using  Parscval’s  theorem,  minimizing  JiTMp)  in  equation 
(2.4)  is  equivalent  to  minimizing  919  i  at  each  G).  This  is  a  standard  least  squares 
approximation  problem  [8]  and  results  in  the  following  solution  for  the  distributed  actua¬ 
tor  inputs. 


V*B 


(2.13) 


where  £,  U  and  V  define  the  standard  singular  value  decomposition  of  A  as  follows 


V*AU  =  [5  ?]•  (2.14) 

Where  the  conjugate  transpose  matrix  operator  is  denoted  by  • .  In  addition  if  A  has  rank 
np ,  which  is  the  number  of  distributed  actuator  inputs,  then  the  least  squares  approxima¬ 
tion  yields 


Vp  =-(A*A)'iA*B  . 

A  sufficient  and  necessary  condition  for  A  to  have  rank  np  is  given  next. 


(2.15) 


Lemma 
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rank[A]  =  np  iff  rank  [B;- =  np+l  (2.16) 

Proof 

Rank  [B?-]  =  I  rank  [GBriGth  Gu  G,t)]  =  1 

=i>rank  [a  ^  GBrCG,*  G,,  Ga)  +  G]]  2  w-l. 

Since  Bj  =  [1  0  0]*,  it  is  easy  to  see  that  the  null  space  of  A  is  the  span  of  [1  0  0]*. 
Hence  rank  A  isn-1.  Noting  that  A  =  A  B^ ,  the  lemma  follows  easily.  □ 

The  above  lemma  requires  that  all  the  columns  of  the  input  matrices  and  Bp  be 
independent.  This  is  computationally  more  efficient  than  checking  the  rank  of  A  for  each 
(0.  Next,  the  corresponding  joint  torque  component,  T  is  then  evaluated  from  equation 
(2.9).  The  inverse  Fourier  transforms  for  the  feedforward  inputs  completes  the  first  itera¬ 
tion  and  results  in  torques,  and  distributed  inputs  Then  the  forward  dynamic 
analysis  is  carried  out  to  compute  Kg  and  G^.  F  in  the  RHS  of  equation  (2.5)  is  updated 
and  the  process  is  repeated  to  find  the  new  input  torques  and  voltages.  The  process  is 
stopped  at  the  n‘^  iteration  if  2  + "  Vp’*-Vp'^~H  2  <  6*  where  e  is  some  small 

positive  constant  It  may  be  noted  that  for  slow  motions  the  terms  involving  Kc  and  Cc 
are  small  relative  to  the  other  terras  in  equation  (2.1)  and  the  iterations  converge  in  a  few 
steps  [1]. 

2.4  The  Algorithms  for  the  Multi-Link  Cases 

In  the  previous  sub-section  the  procedure  to  evaluate  the  joint  actuations  of  a  single 
link  was  presented.  This  can  be  recursively  extended  for  multi-link  flexible  manipulators. 
Algorithms  are  presented  below  for  both  open  and  closed  chain  multi-link  mechanisms. 

Multi-Link  Open  Chain  Case 

1.  Define  the  nominal  motion  (Inverse  Kinematics  of  rigid  manipulator). 

2.  For  each  link  j ,  starting  from  the  last  one  in  the  chain: 

a)  Compute  torque  (or  force)  T i  and  distributed  actuator  inputs  Pi 
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imposing  2/  =  0  (Section  2) 

b)  Compute  the  link  reaction  forces  from  equilibrium, 

3.  Use  equation  (2.1)  to  compute  the  elastic  displacement  and  joint  angles. 

4.  Compute  the  inputs  for  the  next  link,  j-\. 

Multi-Link  Closed  Chain  Case 

1.  Define  the  nominal  motion  (Inverse  Kinematics  of  rigid  robot), 

2.  Define  an  independent  set  of  joint  forces  and  reactions  equal  in  number  to  the  degrees 
of  freedom  of  the  robot 

3.  For  each  link  j ,  starting  from  the  last  one  in  the  chain: 

a)  Compute  torque  (or  force)  T i  and  distributed  actuator 
inputs  Pi  imposing  2/  =  0  (Section  2) 

b)  Compute  the  link  reaction  forces  Rt  from  equilibrium. 

4.  Use  equation  (2.1)  to  compute  the  elastic  displacements  and  joint  angles 

5.  Use  elastic  deflections  to  correct  the  nominal  motion  of  each  link. 

6.  Repeat  steps  3  to  5  until  convergence  in  the  forces/torques  is  obtained 

This  concludes  the  methodology.  In  the  next  section  we  present  an  application  to  a 
two-link  flexible  manipulator. 

3.  Example 

A  twolink  truss  experiment  under  development  at  UCSB  is  shown  in  figure  2.  The 
trusses  are  made  of  lexan  and  have  lumped  masses  (net  2  Kg  for  each  link)  distributed 
along  their  lengths.  The  first  and  the  second  links  are  tip  loaded  with  3.5  and  1  Kg 
respectively.  Equivalent  beam  properties  of  the  trusses  used  in  the  FEM  model  for  simu¬ 
lations  are  Youngs  modulus  =  7  GPa,  Link  length  =  1.2  m,  density  =  1500  Kglm^, 
cross  sectional  area  =  4.378  e~^  and  cross  sectional  area  moment  of  inertia  =  4.7244 
e~^  m^.  Of  the  10  spans  in  each  link,  two  are  piezo-electrically  actuated.  They  are 
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located  at  the  second  and  ninth  spans  as  shown  in  the  figtire  2.  The  piezo-electric  stack 
actuators  in  those  spans  have  the  following  properties.  Cross  sectional  area,  Acs  =7.3 
m2,  piezo  strain  to  voltage  constant,  dsv  =  .731  e~^  Youngs  modulus,  Ep  =  73 
Gpa  and  distance  of  the  actuator  from  the  neutral  axis  of  the  truss,  d,  =  \.21  e~'^  m. 
Following  the  standard  Bernoulli- Euler  modeling  for  an  applied  voltage  the 

piezo-electric  actuation  can  be  considered  as  two  concentrated  moments  W  acting  at  the 
two  ends  of  the  actuator  [9]  and  [10].  Where  M  is  given  by 

M={ds,NpEpAcsds)V-^^  (3.1) 

and  Np~^  is  the  number  of  piezos  in  each  span.  For  the  truss  considered  above 
iVf  =  =  0.0l98Vi;^,tf .  The  desired  trajectory  is  a  rest  to  rest  motion  of  the  structure  with 
initial  conditions  given  by  0i  =  02  =  0  and  final  conditions  0i  =  1 1.25®  and  02  =  -22.5® . 
Where  0's  are  the  absolute  angles  of  the  links  with  respect  to  a  frame  fixed  on  the  ground 
and  are  shown  in  figure  2.  The  nominal  motion  of  the  tip  for  each  link  are  the  trajectories 
followed  by  the  tips  of  the  links  if  the  structure  were  rigid  and  followed  the  nominal 
angular  motions  shown  in  figure  3.  Using  the  procedure  in  section  2.4  for  open-chain 
mechanisms,  open  loop  simulations  were  performed  (1)  using  only  the  joint  actuation  for 
feedforward  and  (2)  using  the  distributed  piezo-electric  actuators  along  with  joint  actua¬ 
tors  in  feedforward  and  the  results  are  presented  below. 

Plots  of  the  input  piezo  voltages  and  joint  torques  are  presented  in  figures  4  and  5  respec¬ 
tively.  To  illustrate  the  viability  of  the  proposed  method  figures  6  and  7  show  the 
transverse  structural  midpoint  deflections  of  the  two  links  during  the  morion  with  and 
without  the  distributed  actuators.  Similar  plots  for  the  clastic  hub  rotations  are  shown  in 
figures  8  and  9. 

Thus  the  piezo-electric  actuators  show  a  significant  reduction  in  the  structural  vibrations 
and  demonstrates  the  viability  of  the  proposed  method.  The  consequent  reduction  in  the 
induced  strains  in  the  structure  allows  the  use  of  lighter  elements  and  smaller  actuators, 
especially  in  space  structures  where  the  loads  are  mainly  inertial. 
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4.  Conclusion 

Typically  distributed  actuators  like  the  piezo-electric  ones  cannot  gamer  enough 
actuation  to  cause  large  motions  in  the  structure.  However  they  could  be  very  effective  in 
controlling  the  small  structural  deformations  in  the  structure.  Their  use  in  the  feedfor¬ 
ward  to  aid  the  joint  actuators  for  trajectory  tracking  is  a  novel  idea  developed  in  this 
paper.  The  method  proposed  was  shown  to  be  extremely  efficient  in  removing  structural 
vibrations  from  structures  as  seen  in  the  example.  Thus  these  feedforward  actuations, 
obtained  through  the  proposed  inverse  dynamics,  augmented  with  joint  angle  feedback 
based  closed  loop  controllers  seem  promising  in  the  slewing  control  of  flexible  manipula¬ 
tors.  This  encouraging  result  motivates  further  work  on  distributed  actuators  in  the  con¬ 
trol  of  flexible  structures. 
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Piezoelectric  Actuator  Design  for  Vibration  Suppression: 

Placement  and  Sizing 
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In  this  paper  we  consider  the  problem  of  simultaneous  placement  and  sizing  of  distributed  piezoelectric 
actuators  to  achieve  the  control  objective  of  damping  vibrations  in  a  uniform  beam.  For  several  closed-loop 
performance  measures  we  obtain  optimal  placement  and  sizing  of  the  actuators  using  a  simple  numerical  search 
algorithm.  These  measures  are  applied  to  the  specific  e.xample  of  a  simply-supported  beam  with  piezoelectric 
actuators,  and  their  relative  effectiveness  is  discussed.  We  demonstrate  that  the  controllability  grammian  is  not 
suitable  to  determine  actuator  placement  for  vibration  suppression  problems. 


.Vomenciature 

Af,  =  cross-sectional  area  of  the  beam 

b  =  cross-sectional  width 

Cp  =  capacitance  of  the  piezoelectric  actuator 

c/31  =  piezoelectric  constant 

Ea  =  Young’s  modulus  of  the  adhesive  layer 

£(,  =  Young’s  modulus  of  the  beam  column 

Ep  =  Young’s  modulus  of  the  piezoactuator 

gji  =  strain  to  voltage  constant 

/(,  =  area  moment  of  inertia  of  the  beam 

/gq  =  equivalent  area  moment  of  inertia 

ip  =  thickness  of  the  adhesive  layer 

/(,  =  thickness  of  the  beam  column 

tp  =  thickness  of  the  piezoactuator 

v,  =  voltage  applied  to  the  top  piezo 

vj  =  voltage  applied  to  the  bottom  piezo 

y(x)  =  vertical  displacement  of  the  neutral  a.xis 

S'  =  derivative  of  the  delta  function 

P(,  =  density  of  the  beam 


I.  Introduction 

HE  control  of  large  flexible  structures  has  been  consid¬ 
ered  for  some  time.'  *  However,  the  recent  application  of 
piezoelectric  materials  by  Crawley’  and  Bailey  and  Hubbard^ 
for  actuation  of  flexible  structures  has  added  new  dimensions 
to  the  control  problem.  This  comes  from  the  fact  that  these 
actuators  can  be  distributed  along  structural  members  for 
vibration  and  shape  control.  In  this  paper  we  consider  the 
problem  of  simultaneous  placement  and  sizing  of  distributed 
piezoelectric  actuators  to  achieve  the  control  objective  of 
damping  vibrations  in  a  uniform  beam. 

Figure  1  shows  a  specific  example  of  a  simply-supported 
beam  with  piezoelectric  actuator  strips  attached  to  both  sides. 
Our  goal  is  to  find  the  position  and  length  of  piezoelectric 
actuators  to  maximize  modal  damping  when  feedback  control 
is  applied.  In  contrast  to  previous  approaches,  we  simulta¬ 
neously  optimize  the  position  and  length  of  the  actuator  strips. 
For  several  closed-loop  performance  measures  we  obtain  opti¬ 
mal  placement  and  sizing  of  the  actuators  using  a  simple  nu¬ 
merical  search  algorithm. 
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In  Crawley’s  early  work  the  actuator  was  simply  placed  with 
one  bending  mode  in  mind  at  the  location  of  maximum  strain 
for  that  mode.’  However  the  placement  problem  for  the  case 
with  two  or  more  controlled  modes  was  not  addressed.  Kon- 
doh  et  al.’  used  the  linear  quadratic-optimal  control  frame¬ 
work  to  perform  sensor  and  actuator  placement,  but  formu¬ 
lated  the  problem  such  that  the  solution  is  initial  condition 
dependent— this  dependence  is  removed  here.  Controllability 
was  used  as  a  performance  measure  for  placement  of  a  point 
actuator  in  Refs.  6  and  7.  These  methods  are  shown  to  yield 
less  effective  results  for  vibration  damping  in  beams  than  those 
based  on  closed-loop  performance  measures.  Previous  works 
on  actuator  placement  have  not  dealt  with  the  sizing  problem. 
We  incorporate  this  as  an  additional  optimization  parameter 
and  show  that  increasing  the  size  of  the  actuator  is  not  neces¬ 
sarily  better. 

Questions  of  robustness  to  spillover  and  actuator  dynamics 
have  been  raised  and  addressed  in  Refs.  1,  2,  and  8,  but  are  not 
addressed  here.  Another  important  issue  is  the  problems  in¬ 
volved  in  the  implementation  of  controllers  for  the  distributed 
piezoactuators.  These  include  depoling,  nonlinearity,  hyste¬ 
resis,  and  creep  effects  in  the  actuator.’  Depoling  may  be 
avoided  by  maintaining  the  applied  field  below  the  coercive 
field.  Within  the  depoling  limits,  the  nonlinearity  between  the 
applied  electric  field  and  the  resulting  actuation  strain  may 
require  the  use  of  more  complex  models.'”  An  alternative  is  the 
linearization  of  this  relationship  about  the  operating  point." 
Creep  and  strain  rate  dependence  of  the  actuator  become  im¬ 
portant  at  large  strains  and  low  frequencies.  Hysteresis  also 
plays  an  important  role  at  low  frequencies.  Significant  perfor¬ 
mance  improvement  over  such  behavior  is  possible  by  com¬ 
manding  the  induced  charge  rather  than  the  voltage  applied  to 
the  actuator.”  Other  related  issues  due  to  actuator  dynamics 
are  considered  in  Refs.  13  and  14,  but  not  addressed  in  this 
paper.  We  note,  however,  that  piezoactuators  have  fast  dy¬ 
namics  relative  to  electromagnetic  actuators  and  are  attractive 
in  this  regard. 

The  remainder  of  the  paper  is  organized  in  the  following 
format.  In  Sec.  II  a  state-space  model  of  a  piezoelectric  actu¬ 
ated  beam  of  finite  length  is  derived.  Section  Ill  formulates 
the  placeir.v.it  problem  with  respect  to  three  performance  cri- 


Flg.  I  Simply-supporied  beam. 
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teria.  These  are  applied  to  the  simply-supported  beam  in  Sec. 
IV  and  compared.  Our  conclusions  are  made  in  Sec.  V.  The 
Appendix  contains  detailed  calculations  of  the  mechanics  of  a 
piezoactuated  beam. 

II.  State-Space  Model  of  Actuated  Beam 

In  this  section  we  develop  a  state-space  model  for  the  trans¬ 
verse  vibrations  of  the  finite  beam.  The  boundary  conditions 
are  any  combination  of  pinned,  clamped,  or  free.  A  pair  of 
piezos  attached  to  the  top  side  and  bottom  side  of  the  beam  is 
used  to  actuate  the  beam  as  shown  in  Fig.  2.  A  second  pair  is 
used  as  sensors.  Collocation  is  achieved  since  the  two  pairs  are 
lo  'ated  side  by  side.  The  input  to  the  system  is  the  voltage  v 
applied  to  the  actuator  pair  and  the  output  is  the  strain- 
induced  voltage  generated  by  the  sensors.  We  show  in  the 
Appendix  that  the  net  forcing  of  the  beam  is  equivalent  to  two 
equal  and  opposite  moments,  M  and  -M  (Fig.  3),  applied  to 
the  beam  at  the  piezo  endpoints  Xj  and  X2.  Moreover,  the 
moment  M 

is  proportional  to  the  input  voltage 


It  is  clear  from  Eq.  (7)  that  the  /th  mode  is  controllable  if  and 
only  if  B,  is  nonzero.  In  this  paper  we  are  trying  to  control 
multiple  modes  with  a  single  piezo  and  so  there  must  be  a 
compromise  placement  and  sizing  such  that  B,  for  each  con¬ 
trolled  mode  is  nonzero. 

With  the  sensor  placed  as  shown  in  Fig.  2,  the  output  of  the 
sensor  piezos  is  a  linear  combination  of  the  modal  amplitudes. 
Substituting  Eq.  (3)  into  Eq.  (A9)  of  the  Appendix  yields 

K(0  =  ^5  S  ’),(0['I>,'(X2)-'I>;(X|)]  (8) 

I  s  1 

Define  C,  by 

c,  4  a:,  [♦; (Xj )--!>; (X,)]  (9) 

then 


HD  =  Y,  C,r,,(t)  (10) 

I  s  I 


M  =  Kv  (I) 

The  partial  differential  equation  describing  the  distributed  pa¬ 
rameter  system  is  therefore 

Ebhy“"  +PbAi>y  =  .V/[6'(x-X2)-5'(x-x,))  (2) 

We  assume  that  the  effects  of  the  actuator  on  the  mode  shapes 
are  negligible,  which  is  valid  if  the  dimensions  of  the  piezo  are 
small  compared  with  those  of  the  beam.  This  formulation  is 
simple  and  is  sufficient  for  the  work  presented  i.n  this  paper. 
More  detailed  models  are  available  in  Ref.  9.  Our  assumption 
of  a  finite  beam  with  pinned,  clamped,  or  free  boundary  con¬ 
ditions  guarantees  a  modal  decomposition  of  the  form 


y(x,t)  =  *Xx)v,U)  (3) 

:  -  I 

where  the  'l>,(x)  are  the  normalized  orthogonal  mode  shapes 
and  the  are  the  modal  amplitudes.  Substituting  Eqs.  (1) 
and  (3)  into  (2)  and  projecting  onto  the  ith  mode  yields  decou¬ 
pled  modal  equations 

p*A*r;,(0  +  =  ['1>;(x2)-*/(x,)]A’v(/)  (4) 


If  C,  is  nonzero  then  the  ith  mode  is  observable.  In  our  case  of 
collocated  sensors  and  actuators  (Fig.  2),  the  ith  mode  is  ob¬ 
servable  if  and  only  if  it  is  controllable. 

If  we  truncate  our  representation  to  n  modes,  the  dynamics 
can  be  written  as 


z(t)  =  Az(t)  +  Sv(/) 
K(0  =  C'z(i) 


where 


Z  =  [171  h: 


h-i  Ti  Vt 


Ak 


O,  l„ 

0,. 


Bk 


On  X  I 

s 


Ont\ 

fll 


(11) 


(12) 


where  determines  the  modal  stiffness  and  is  given  by 


.’t.» 

k  \  4>/  ■  ♦("’  dx 

J  0 


(5) 


Define 


Ck[C  0,x,]  =  [C,  Cj  -  C„  0,„]  (13) 

and  R  =  diag(u, w„). 

We  emphasize  the  fact  that  B  and  C  depend  on  the  piezo 
position  and  length  through  X|  and  X2. 


2  ^  Ehh'l': 

PhAi, 


and  S,  4 -i- [♦'(x2)-*'(x,)]a: 

PbAi,  '■  ‘ 


(6) 


Then  equation  (4)  can  be  written  as 


ri,{t)  -E  wfiJili)  =  Biii(t) 


(7) 


III.  Optimal  Placement  and  Sizing  of  Piezoactuators 

In  this  section  we  formulate  three  optimization  problems  for 
determining  a  good  placement  and  length  of  the  piezoactuator. 
In  words  they  are  the  following:  1)  subject  to  the  constraint 
that  collocated  damping  control  is  used,  find  the  placement 
and  length  that  maximizes  damping  uniformly  in  the  modes: 


3eam 


Fig.  2  Beam  segment  with  actuator. 


Fig.  J  Equivalent  beam  segment. 


DEVASIA  ET  AL.:  PlEZOELECmiC  ACTl,  ATOR  DESIGN 


S6I 


2)  assuming  the  system  is  detectable  and  stabilizable,  find 
the  placement  and  length  that  minimizes  a  standard  linear 
quadratic  cost  functional  uniformly  in  initial  conditions:  and 

3)  find  the  placement  and  length  that  maximizes  the  minimum 
eigenvalue  of  the  controllability  grammian  (as  is  done  only  for 
the  actuator  placement  problem  by  Arbel‘). 

Passive  Damping  Case 

For  the  first  optimization  problem,  we  assume  a  pure  collo¬ 
cated  damping  control  given  by 

v(l)  =  k^Cr,  (14) 

where  j)  =  [>7i  r;,]  and  is  the  controller  gain.  With 

this  control,  Eq.  (11)  can  be  written  in  vector  form  as 

ij  +  Q'rj  =  k^BCrj  (15) 

and  the  state-space  description  becomes 

z{t)  =  Az{t)  (16) 

where 


The  corresponding  control  is 

v(r)  =  -R-'B’’Pz{c)  (21) 

We  propose  minimizing  J,  for  the  worst  case  initial  condition 
and  therefore  pose  the  following  optimization  for  computing 
Xp  and  Lp . 

min  max  Zo^zih)  (22) 

Lpi[O.L^\  -.3  =1 

This  method  optimizes  performance  uniformly  in  initial  condi¬ 
tions  in  contrast  to  the  approach  by  Kondoh  et  al.'  where  a 
solution  sensitive  to  initial  conditions  is  proposed. 

Controllability  Grammian  .Method 

Finally,  we  describe  a  placement  procedure  based  on  the 
controllability  grammian  proposed  by  Ref.  6.  This  method  is 
useful,  but  has  certain  disadvantages  discussed  in  the  Conclu¬ 
sion.  With  the  inclusion  of  structural  damping  the  matrix  .4  in 
the  state-space  description  Eq.  (11)  becomes 


A  4 


Op  Ip 

-  n*  -  kpBC 


(17) 


The  advantage  of  such  a  collocated  passive  control  is  that 
controllers  designed  for  a  fmite-dimensional  model  retain  the 
stability  of  the  infinite  dimensional  plant  provided  that  actua¬ 
tor  dynamics  can  be  neglected. Since  the  bandwidth  of  a 
piezo  is  only  limited  by  its  capacitance,  actuator  dynamics  can 
be  justifiably  ignored  for  large  space  structures.  The  imple¬ 
mentation  issues  of  such  controllers  have  been  addressed  in 
Refs.  9  and  14. 

We  measure  the  system  performance  for  a  particular  choice 
of  controller,  placement,  and  piezo  length  by  the  rate  of  decay 
of  system  states  and  therefore  seek  to  place  the  poles  of  the 
system  far  Into  the  left  half  of  the  complex  plane.  More  for¬ 
mally,  we  perform  the  following  optimization 


min 

ip  i  i0.i,| 

XpilLp  l.Lp-Lpl] 


max  BeXi(A ) 


(18) 


where  \,(/l )  is  the  ith  eigenvalue  of  A.  We  vary  the  length  of 
the  piezo  Lp  from  zero  to  the  length  of  the  beam  The  piezo 
mu  t  not  overlap  the  ends  of  the  beam  hence  the  center  posi¬ 
tion  Xp  of  the  piezo  is  varied  from  Lp/2  to  T*  -Lp/2. 

Linear  Quadratic  Regulator  Case 

The  linear  quadratic  regulator  (LQR)  is  attractive  because 
the  controller  stabilizes  the  closed-loop  system  and  also  allows 
for  user  defined  weights  on  the  inputs  and  states.  LQR  opti¬ 
mization  has  been  used  to  reduce  the  structural  vibrations  in 
the  control  of  large  fle.xible  structures  in  Refs.  4  and  7  for  a 
fixed-size  actuator.  Here  we  include  placement  and  sizing  in 
the  optimization.  For  the  system  described  by  Eq.  (11),  con¬ 
sider  the  infinite-horizon  time-optimal  control  problem  of 
minimizing  a  quadratic  cost  functional  given  by 


where  f  is  the  structural  damping  coefficient.  Since  .4,  is  sta¬ 
ble,  as  the  final  time  T  tends  to  infinity,  it  can  be  shown  that 
the  finite  time  controllability  grammian  M'(0,n  approaches 
W,  which  is  the  solution  of  the  Lyapunov  equation 

iVAl ->■  A,iV  *  BB^  =  0  (24) 


Based  on  Arbel’s  method  we  perform  the  optimization 

max  min  X,(IT')  (25) 

LpU0.L>] 

XpdLpl.  Li-Lp'.] 

to  maximize  the  controllability  of  all  the  modes.  The  three 
approaches  discussed  here  are  compared  via  an  example  in  the 
next  section. 


IV.  Example:  Simply-Supported  Beam 

Consider  the  simply-supported  actuated  beam  (Fig.  1)  hav¬ 
ing  the  following  properties. 

Beam  properties 

f  =  0.01  E^=70CPa.  Lp=0.5m 

tp  =  0.01  m  b  -  0.05  m  Pt  =  2500  Kg/m^ 
Piezoactuator-sensor  properties 

Ep  =  62  GPa  Lpi[0,Lp]  fp=2xl0-‘m 

dj,  =  120xl0-'2m/V  g,,  =  10.6X  10-' Vm/N 

Cp  =  35  pC 

Adhesive  properties 

£a=2.4GPa  La  =  Lp  tj=2xl0-'m 


y,  ^  I  [f?v(r)^-f-z’'(f)Qz(r)]  dr  (19) 

.0  ' 

where  R  is  a  positive  scalar  and  Q  is  a  positive  semidefinite 
matrix  such  that  the  pair  {A,Q  ■)  is  observable.  Provided  sys¬ 
tem  (11)  satisfies  the  standard  conditions  of  stabilizability  and 
detectability,  the  minimum  cost  min  J,  is  given  by  z^to)Pz(to), 
where  P  is  the  unique  nonnegative-definite  solution  to  the  alge¬ 
braic  Riccati  equation'* 


Using  the  geometric  boundary  conditions 

v(0,f)  =y(L,r)  =  >"(0,1)  =y'(L,r)  =  0  (26) 

together  with  Eqs.  (2)  and  (3),  we  obtain  the  equation  for  the 
ith  mode  shape  normalized  so  that 

I  <l>f  djc  =  1 
.  0 


(2') 


PA  *  A^P  -  PBR  -  'B^P  ^Q=0 


(20) 


4>,(.ir)  =  \2/Lp  sin(irx/Lf,) 
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Substituting  this  into  Eqs.  (5)  and  (6),  we  have 


i-T- 


£t,h 


(28) 


The  /th  component  of  the  input  vector  B  is  obtained  from 
Eqs.  (6)  and  (27)  and  is  given  by 


ibt^K*x  2 

I  /fTq  Z,  I) 


(29) 


where  K*  and  Zgq  are  as  defined  in  the  Appendix  by  equations 
(A3)  and  (A4),  respectively.  The  tth  component  C,  of  the  out¬ 
put  vector  C  is  obtained  from  Eqs.  (9)  and  (27): 


2C,U  \Z.* 


(30) 


Note  that  a  necessary  condition  for  observability  and  con¬ 
trollability  of  the  tth  mode  is  for  the  term  in  parentheses  in 
Eq.  (30)  to  be  nonzero.  From  Eq.  (29)  we  see  that  B,  is  large  if 
x,  and  X:  are  chosen  to  lie  near  two  different  nodes  of  the  tth 
mode  separated  by  an  odd  number  of  half  cycles  of  the  mode 
shape  (hence  a  bigger  actuator  is  not  necessarily  better!).  Also 
the  t  th  (  /  >  1)  mode  becomes  uncontrollable  if  the  center  of  the 
piezo  coincides  with  one  of  its  nodes  or  the  piezo  length  is  an 
integral  multiple  of  2L*/i  .  For  example,  if  the  piezo  length  Lp 
is  the  same  as  the  beam  length  Z.(,,  then  all  of  the  even  modes 
are  uncontrollable. 


Simulations  were  performed  for  each  of  the  optimization 
schemes  discussed  in  Sec.  Ill  applied  to  the  earlier  truncated 
beam  model  (with  the  first  two  modes).  A  uniform  structural 
damping  coefficient  of  0.01  is  considered  for  both  of  the 
modes.  This  implies  that  the  system  is  stable  and  hence  both 
detectable  and  stabilizable.  For  the  passive  damping  case  a 
controller  gain  kj  =  \e  was  used.  In  the  LQR  based  design 
the  placement  depends  on  Q,  especially  if  some  states  are  not 
penalized.  However,  to  uniformly  penalize  all  modes,  Q  was 
taken  as  It  was  observed  in  the  simulations  that  the 
optimum  placement  and  sizing  did  not  vary  with  R.  The  results 
presented  are  for  the  Z?  =  1  case.  The  variations  of  the  objec¬ 
tive  function  for  the  passive  damping  case  with  1)  the  position 
of  the  center  of  the  piezo  Xp  (optimal  over  all  possible  actuator 
lengths)  and  2)  the  length  of  the  piezo  Lp  (optimal  over  possible 
piezo  placements)  are  shown  in  Fig.  4.  Similar  simulation  re¬ 
sults  for  the  LQR  and  the  controllability  grammian  methods 
are  shown  in  Figs.  5  and  6,  respectively.  Note  that  as  the  piezo 
length  Lp  approaches  the  beam  length  Z.*,  its  midpoint  is 
pushed  toward  Lp/1,  making  the  second  mode  less  control¬ 
lable.  Hence  the  cost  increases  as  Lp  tends  to  Z.*  and  leads  to 
an  optima)  actuator  length  less  than  Z.*.  The  optimal  lengths 
and  positions  for  the  three  methods  are  given  in  Table  1 . 

To  make  a  comparison  between  our  LQR  method  and  the 
initial  condition  dependent  methods  described  in  Refs.  3  and 
5  we  consider  the  following  two  different  initial  conditions 
ll  =[1  0  0  0]  and  zl  =[010  0),  which  correspond  to  unit 
displacements  in  the  first  and  the  second  modes,  respectively. 
The  variation  of  the  cost  of  control  over  different  actuator 


Fig.  4  Passive  dacnpiag  based  optimization. 


Fig.  5  Linear  quadratic  regulator  based  optimization. 


Fig.  6  Conirollabilty  grammian  based  optimization. 
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locations  for  each  of  the  earlier  initial  conditions  is  shown  in 
Fig.  7.  The  optimization  based  on  these  initial  conditions  re¬ 
sults  in  two  different  optimum  piezo  positions  at  L4/2  and 
1.9/4  suggesting  piezo  placement  at  locations  where  the  strain 
is  maximum  for  the  corresponding  initial  condition.  The  opti¬ 
mal  piezo  positions  for  various  initial  conditions  that  are  linear 
combinations  of  the  two  modes  will  vary  widely.  This  problem 
is  easily  resolved  by  the  LQR  methodology  proposed  in  this 
paper  as  it  optimizes  over  all  initial  conditions. 

Next  we  discuss  the  controllability  grammian  method  ap¬ 
plied  by  Arbel.‘  The  optimized  minimum  energy  control  cost 
for  given  initial  condition  z(to)  and  final  condition  z(T)  is 
equivalent  to 


Table  1  Results 

Controller 

Lp 

Optimal  — 

Lb 

Xp 

Optimal  — 

Passive 

0.7723 

0.3961 

LQR 

0.7921 

0.4060 

Contr.  Gramm. 

0.6139 

0.3267 

This  decrease  in  controllability  can  be  easily  illustrated 
through  a  scalar  example  described  by 


y  4  [zlD  -  ‘  (0,r)[2(r)  -  '“»z(fo)] 

(31) 

where  is  the  state  transition  matrix."*  J  is  the  energy 

required  to  deviate  from  the  natural  motion  of  the  system, 
which  would  have  reached  the  state  at  time  T 

without  the  application  of  any  control.  ArbeTs  method  is 
based  on  optimizing  J  (over  all  possible  initial  and  final  con¬ 
ditions)  and  results  in  minimizing  the  maximum  eigenvalue  of 
the  inverse  of  the  controllability  grammian,  fV''(Q,T),  over 
different  actuator  positions.  However,  the  control  cost  J  for 
the  finite  time  regulator  problem  Iz(D  =  01  is  given  by 

y,  4  [e^-<r-io)z(,„)]'tf'->(0,r)[e'^'i^-'o'z(to)]  (32) 

ard  is  not  suitably  evaluated  by  ArbeTs  controllability  gram¬ 
mian  approach  because  it  fails  to  consider  the  effect  of 
gA,{T-iii)  on  y,  in  Eq.  (32).  Next  we  show  that  the  maximum 
eigenvalue  of  H'"  'fO.D  increases  as  the  system  poles  move  far 
into  the  left  half  of  the  complex  plane.  Let  7,  be  an  eigenvalue 
of  iV  and  let  K  be  the  corresponding  eigenvector  such  that 
K,;!  2  =  1 .  Pre-  and  post-multiplying  the  Lyapunov  Eq.  (24)  by 
and  Vi,  respectively,  we  obtain 

y,^fVA]y,  +  y,^A,ivy^  -e  y,^BB^yi  =o  (33) 

Then 

yrBB^y  \max\,{BB^)\ 

y  _ _ ! - 1  <  - '■ _  (34) 

iyfA,yi  minlX,(/l,)| 

I 

This  result  implies  that  for  a  given  B  the  eigenvalues  of  W'*' 
increase  if  the  eigenvalues  of  A,  move  far  into  the  left  half 
plane.  Hence  ArbeTs  approach  deems  the  system  as  less  con¬ 
trollable  and  therefore  not  desirable  even  though  the  shifting 
of  the  poles  is  advantageous  for  vibration  reduction. 


x(t)  =  ~ax(t)  +  bu(t)  (35) 

Using  ArbeTs  method,  for  a  large  final  time  T,  the  eigenvalue 
of  If'fO.TO  tends  to  b‘/2a,  which  decreases  as  a  increases,  and 
subsequently  the  controllability  measure  deems  the  system  as 
less  controllable  even  though  from  a  vibration  reduction  per¬ 
spective  it  is  more  desirable.  A  similar  effect  due  to  the  high 
decay  rate  in  the  second  mode  is  seen  in  the  example  of  the 
simply-supported  beam.  The  two  modes  have  the  same  damp¬ 
ing  coefficient,  and  so  the  second  mode  has  a  faster  decay  rate. 
ArbeTs  placement  method  therefore  assigns  the  second  mode 
as  less  controllable  relative  to  the  first  mode  and  hence  pushes 
the  placement  toward  Lt,/A  where  the  second  mode  has  maxi¬ 
mum  strain.  Because  of  a  lower  decay  rate,  control  of  the  first 
mode  (which  is  more  controllable  when  the  placement  is  at  the 
center  of  the  beam)  is  more  critical  from  a  vibration  suppres¬ 
sion  perspective.  However,  the  placement  obtained  by  ArbeTs 
method  is  further  away  from  the  center  of  the  beam  and  hence 
results  in  slower  vibration  decays  in  the  system  as  compared  to 
the  results  obtained  through  LQR  or  passive  damping  methods 
(Table  I).  Thus  for  vibration  reduction,  passive  damping  or 
LQR  based  objective  functions  are  more  suitable  to  design  the 
placement  of  actuators. 

V.  Conclusion 

In  this  paper  we  formulated  actuator  placement  and  sizing 
methodologies  for  vibration  suppression  in  uniform  beams. 
Several  closed-loop  performance  criteria  were  considered  to 
derive  objective  functions  for  optimum  placement  and  sizing 
of  piezoelectric  actuators  in  uniform  beams.  The  papei  illus¬ 
trated  through  an  example  that  passive  damping  or  linear 
quadratic  regulator  based  measures  are  more  suitable  than  the 
controllability  measure  for  the  placement  and  sizing  of  actua¬ 
tors  to  obtain  vibration  reduction.  The  procedures  developed 
led  to  solutions  that  are  independent  of  initial  conditions.  The 
design  is  also  formulated  as  an  eigenvalue  problem  thereby 
reducing  the  required  computation  considerably. 

We  also  note  that  the  measures  proposed  in  this  paper  can 
be  applied  to  any  general  linear  time  invariant  system  and 
hence  the  actuator  design  for  vibration  suppression  can  be 
based  on  these  measures.  The  question  of  the  existence  of 
optimal  designs  over  different  controllers  (e.g.,  all  passive  con¬ 
trollers)  for  general  linear  time  invariant  systems  is  the  subject 
of  further  research. 

Appendix;  Mechanics  of  Piezo  Actuated  Beam 

Consider  a  segment  of  the  piezo  beam  as  shown  in  Fig.  2.  It 
is  assumed  for  simplicity  that  the  width  of  the  piezo  is  equal  to 
that  of  the  beam.  Using  the  standard  Bernoulli-Euler  beam 
approach,  the  moment  generated  by  voltage  applied  to  the 
piezos  is  given  by  Ref.  9 

■Vf*  =  h/p£4(A,-A;)(/i,/2-i-f, -‘•fp/2)]  (Al) 

where 


t, 


Fig.  7  Initial  conditions  based  optimization. 


I  =  1.2 


(A2) 
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Thus 


=  bE„d^dt>,<  2  +  i,  +  2)(v,  ~v.)^K*(v.-v.)/2  (A3) 

.V/*  is  the  effective  bending  moment  acting  on  a  beam  of 
equivalent  area  moment  of  inertia  given  by 


4.= 


"lA  ^2bjl- 

12  '‘'\2  2/  12 


(A4) 


where  b^  =  b{Ea/ tf,)  and  bp=b{Ep/E),)  are  the  equivalent 
adhesive  and  piezo  widths,  respectively.  Then  the  resultant 
strain  of  the  actuator  is 


i(x,h) 


\i*{x)h 


(A5) 


where  x  is  measured  along  the  length  of  the  piezo  and  h  is 
the  distance  from  the  neutral  axis.  The  previous  expression 
is  based  on  the  Bernoulli-Euler  model  described  in  Ref.  9 
and  a  detailed  explanation  of  the  results  based  on  this  model 
can  be  found  in  Ref.  17.  If  the  voltages  applied  at  the  top  and 
bottom  layers  of  the  piezoactuators  are  equal  in  magnitude 
and  opposite  in  sign  (vi  =  -  v,  4  v),  then  equation  (A3)  be¬ 
comes  M*  =  K*v .  Without  loss  of  generality  one  can  write  the 
effect  of  the  piezo  on  the  beam  as  two  equal  and  opposite 
concentrated  moments  whose  magnitudes  are  given  by  M  ~Kv 
as  shown  in  Fig.  3,  where 

(A6) 


The  sensor  output  is  obtained  as  follows.  The  incremental 
charge  d^  generated  on  an  infinitesimal  area  dx  is  given  by 


JQ 


=  ?u 


tj,  d'y(x) 
2 


b  dx 


(A7) 


This  may  be  Integrated  over  the  sensor  covered  length  of  the 
beam  to  yield  the  sensor  output  voltage 

K(0  =  [>''(x2,r)-y'(X|,r)|  (A8) 


=  [y '(X2,t)->''(x, ,/)]  (A9) 

wheregii  :•>  the  strain  to  voltage  constant,  Cp  is  the  capacitance 
and  Xi  and  Vi  are  the  locations  of  the  piezo  ends. 
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ABSTRACT 

In  this  paper  we  consider  the  problem  of  placing  piezoelectric  vibration 
dampers  in  an  experimental  articulated  flexible  structure  (AFS).  The  placement 
problem  for  these  jointed  structures  is  complicated  by  their  nonlinear  dynamics. 
Even  with  small  linear  vibrations  about  a  given  rigid  body  conflguration,  a  good 
placement  for  one  configuration  should  not  necessarily  work  well  after  the 
structure’s  joints  ate  rotated.  With  a  finite-element  model,  we  have  indeed 
observed  a  large  variation  in  modal  frequencies  (for  the  linearized  dynamics)  as 
the  joint  angles  are  varied.  However,  we  have  observed  that  for  our  structure  the 
mode  shapes  are  relatively  invariant  to  joint  configuration  and  hence  uniformly 
good  placements  are  possible.  We  also  show,  using  a  singular  perturbation  argu¬ 
ment,  that  such  uniformly  good  placements  can  be  expected  in  a  class  of  AFS 
with  joint  dominated  inertias.  Two  methods  of  placement  are  explored.  The  first 
is  based  on  a  simple  damping  controller  where  placements  are  chosen  to  maxim¬ 
ize  the  decay  rate  of  vibrations.  The  second  method  optimizes  placement  based 
on  closed-loop  perfonnance  of  Linear-Quadratic  Regulator.  The  two  methods  are 
contrasted  using  a  finite-element  model  of  our  structure. 

*  Reiearch  Atsitum,  Deptnineot  of  Mechanical  &  Environment^  Eaiineehng 
**  Associate  Professor,  Depaiunent  of  Mechanical  A  Environmental  Engineeiin| 
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1.  Introduction 

Control  of  Articulated  Flexible  Stnictures  (AFS)  is  important  in  many  space  and  aerospace 
applications.  The  control  objectives  are  application  dependent,  for  example  in  articulated  flexible 
manipulators  the  objective  may  be  to  track  a  given  end-effector  trajectory  or  to  reduce  the  elastic 
vibrations  in  the  structure.  The  problem  of  using  point  actuatore  has  been  addressed  in  [1]  and  [2) 
where  the  inverse  dynamics  problem  has  been  solved  to  obtain  the  control  inputs.  CTosed  loop 
controllers  which  achieve  exponentially  stable  trajectory  tracking  have  been  proposed  by  Paden 
et  al  [3].  However,  errors  in  the  modeling  of  the  structure,  approximations  in  the  implementation 
of  the  controllers  and  disturbances  from  external  sources  introduce  vibrations  in  the  structure. 
This  vibration  problem  is  compounded  by  backlash  and  friction  in  the  motor  drives  which  make 
small  amplitude  vibrations  difficult  to  control.  These  vibrations  adversely  affect  the  system  per¬ 
formance  and  are  to  be  minimized.  This  problem  can  be  mitigated  by  using  piezoelectric  or  elec- 
trostrictive  actuators.  Substantial  success  in  vibration  reduction  has  been  reported  [4]  by  incor¬ 
porating  such  distributed  actuators  in  the  flexible  stnictures.  To  optimize  the  vibration  reduction 
with  distributed  actuators,  synergistic  structure-controller  designs  are  necessary,  [5]  and  [6].  A 
key  issue  in  any  such  design  is  actuator  placement 

In  this  paper  the  placement  problem  for  distributed  actuators  in  articulated  flexible  struc¬ 
tures  is  addressed.  In  contrast  to  this  nonlinear  problem  the  placement  issue  for  linear  systems  is 
well  studied  in  literature.  In  Crawley’s  early  work  on  systems  with  linear  dynamics,  the  distri¬ 
buted  actuator  was  simply  placed  with  one  bending  mode  in  mind  at  the  location  of  maximum 
strain  for  that  mode  [7].  However  the  placement  problem  for  the  case  with  two  or  more  con¬ 
trolled  modes  was  not  addressed.  This  was  done  using  a  controllability  based  performance  meas¬ 
ure  by  Arbel  [8].  The  effectiveness  of  measures  based  on  closed  loop  controllers  ([9]  andflO]) 
over  those  based  on  the  controllability  grammian  for  vibration  damping  in  linear  flexible  struc¬ 
tures  was  demonstrated  by  Devasia  et  al[\\].  However,  the  computations  of  such  measures  are 
not  tractable  for  general  trajeaories  in  systems  like  AFS  which  have  nonlinear  dynamics. 

We  introduce  two  measures  in  this  paper  which  are  useful  to  determine  the  placement  of 


actuators  to  achieve  structural  vibration  reduction  in  AFS.  The  goal  of  vibration  damping  based 
actuator  placements  is  to  improve  the  convergence  rate  for  a  set  of  equilibrium  points.  For  our 
case  these  equilibria  are  rigid  body  configurations  with  zero  elastic  deflections.  Our  approach  can 
be  summarized  as  1)  linearizing  about  equilibrium  points,  2)  evaluation  of  a  local  cost  functional 
at  the  different  equilibrium  points  and  3)  selection  of  placement  based  on  a  suitably  constructed 
global  measure  (pertaining  to  the  entire  set  of  equilibrium  points).  The  measures  are  then  applied 
to  an  example  two-link  flexible  manipulator  to  solve  the  placement  problem  for  a  piezoelecuic 
actuator. 

The  remainder  of  the  paper  is  organi.’ed  in  the  following  format  Our  approach  is  described 
in  section  2.  In  section  3  we  illustrate  the  methodology  with  an  example  of  actuator  placement  in 
a  two-link  flexible  truss  structure  (figure  1).  For  this  articulated  structure,  modal  frequencies  of 
linearizations  ch;uige  considerably  with  configuration  and  one  may  expect  that  a  uniformly  good 
placement  is  impossible.  The  saving  fact  is  that,  although  the  frequencies  change,  the  mode 
shapes  are  relatively  invariant  This  results  in  placements  that  are  uniformly  "good”  over  all 
configurations.  Section  4  shidies  a  class  of  systems  (with  joim  dominated  inertias)  where  such 
uniformly  good  placements  can  be  expected  due  to  mode  shape  invariance.  Our  conclusions  are 
in  section  5. 

2.  Performance  Measures  for  Actuator  Placement 

In  this  section  we  formulate  the  problem  for  the  placement  of  distributed  actuators  in  a  gen¬ 
eral  articulated  flexible  structure  (AFS).  The  set  of  equilibria  in  AFS  consists  of  rigid  body 
configurations  with  zero  elastic  deformations.  Our  objective  is  to  damp  elastic  vibrations  about 
these  rigid  body  equilibritun  configurations.  In  the  following  section  solution  methodologies 
based  on  two  different  measures  are  described  in  detail.  We  start  with  a  description  of  the  general 
acmator  placement  problem. 
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General  Placement  Problem 

The  general  equation  of  motion  for  an  articulated  flexible  structure  can  be  described  by  a 
nonlinear  differential  equation  as, 

x{t)=fp(x,u{t)) 

y=hpix),  (2.1) 

where  x  is  a  vector  of  the  system  states  and  p  belongs  to  the  set  of  possible  actuator  placements 
Xp .  A  formal  dccign  prcceduie  To*  aciuator  placement  involves  the  following:  Given  a  set  of 
desired  state  trajeaories,  a  set  t/  of  possible  inputs  and  a  set  Xp  of  possible  placements,  evaluate 
the  formal  expression 

where  Xu  is  the  actual  trajectory  followed  by  the  system  when  actuated  by  input  u  and 
J uiti ,{Xu-Xd))  is  a  functional  that  defines  the  design  objectives.  The  issue  is  to  choose  an 
appropriate  functional  which  penalizes  undesirable  behavior  and  is  also  corapuutionally 
viable  —  an  example  is  a  cost  functional  which  is  quadratic  in  control  input  u  and  the  state  tra¬ 
jectory  error  (x„  - Xd).  However,  in  the  general  noifiiivear  setting  (2.1)  the  minimax  problem  is 
intractable. 

Here  we  address  the  simpler  issue  of  actuator  placement  to  suppress  elastic  deflections 
about  given  rigid  body  configurations.  Therefore,  our  control  objective  is  to  obtain  uniformly 
good  regulation  of  elastic  deformations  at  any  equilibrium  state  (rigid  body  configuration)  of  the 
articulated  structure.  This  implies  that  the  set  of  desired  trajeaories  Xd .  are  constant  trajectories. 
Our  approach  is  based  on  linearization  of  the  system  equations  at  each  equilibrium  point  in  the 
set  Xo  followed  by  a  local  cost  measure  evaluation  at  these  equilibrium  points.  In  the  following 
sub- section  we  describe  this  procedure. 

Linearized  State  Space  Model  and  Local  Measures 

Low  amplitude  elastic  deformations,  about  a  given  rigid  body  equilibrium  configuration  Xo 
and  for  a  given  placement  p ,  of  an  articulated  flexible  strucmre  can  be  approximated  bv  a  finite 
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number  of  equations  using  the  Finite  Element  Method  (FEM)  or  assumed  modes  method  as  given 
below  [12] 


(2.2a) 

y(0  =  Cp,x,z(t), 

(2.2b) 

where  z  e  R'^  represents  deformations  and  deformation  rates,  u  e  is  a  vector  of  the  / 
inputs,  and  yit)e  R^f  is  the  output  vector. 

Using  the  model  (2.2),  we  introduce  two  measures  to  determine  local  costs  for  different 
actuator  placement  In  words  they  are  the  following.  (1)  Subject  to  the  constraint  that  a  given 
"passive"  control  is  used,  find  the  placement  that  maximizes  damping  uniformly  in  the  elastic 
modes.  (2)  Assuming  all  states  of  equation  (2.2)  are  stabilizable  and  detectable,  find  the  place¬ 
ment  that  minimizes  a  standard  Linear  Quadratic  Regulator  (LQR)  cost  functional  uniformly  in 
initial  conditions  and  rigid  body  configurations. 

Passive  Control  Case 

We  use  the  term  "passive"  to  mean  input-output  passivity  as  defined  by  [13].  A  dynamical 
system  is  passive  if,  whenever  its  initial  states  xq  =  0  at  time  to,  its  input  m  (x)  6  /?  *  and  output 

t 

y  (X)  e  *  satisfy  Ju^  (x)  y  (x)  dx  >  0  for  all  time  t  >  to.  Local  collocated  controllers  like  the 

rate-based  damping  controUeri  for  flexible  structures  fall  under  this  category.  In  addition  to  the 
simplicity  of  these  passive  controllers,  they  exhibit  robusmess  to  spill-over  problems  due  to 
unmodeled  dynamics  [14].  For  this  optimization  problem,  let  the  feedback  control  input  be 
specified  by 

u(t)  =  -K^Cp,^z  .  (2.3) 

With  this  control,  equation  (2.2a)  can  be  written  in  vector  fonn  as 


z(r)  =  Ap,x,2(0 


(2.4) 
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where 

~  ,x,~^p ,x^  ■  (2.5) 

\Vc  measure  the  system  performance  for  a  particular  choice  of  controller  and  placement  by 
the  decay  rate  of  the  linearized  system  states  and  therefore  seek  to  place  the  poles  of  the  system 
(2.4)  far  into  the  left  half  of  the  complex  plane.  More  formally,  we  generate  the  following  local 
cost  measure  at  a  configuration  Xq  and  placement  p . 

c(p,Xo)  ^  mpt  ReXj(i4p,;(,)  (2.6) 

where  A.i(Ap.x,)isthei'^  eigenvalue  of 

Linear  Quadratic  Regulator  Case 

The  Linear  Quadratic  Regulator  (LQR)  is  attractive  because  the  controller  stabilizes  the 
closed  loop  system  and  also  allows  for  user  defined  weights  on  the  inputs  and  states.  LQR  optim¬ 
ization  has  been  used  to  reduce  the  structural  vibrations  in  the  control  of  large  flexible  stnicmres 
in  [9]  ,  [10]  and  [15].  For  the  system  described  by  equation  (2  2),  consider  the  infinite-horizon 
ume-optimal  control  problem  of  minimizing  a  quadratic  cost  functional  given  by 

Ju  ^  J[Ru{t)^  +  z^it)Qz{t)]dt  (2.7) 

where  R  is  a  positive  scalar.  Q  is  a  positive  semi-definite  matrix  such  that  the  pair  {Ap^x,'Q 
is  observable.  Provided  system  (2.2)  satisfies  the  standard  conditions  of  stabilizability  and  detec- 

N 

lability,  the  minimum  cost,  is  given  by  z^(fo)^^(^o)  where  P  is  the  unique  positive 

definite  solution  to  the  algebraic  Ricatti  equation  [16] 

PAp,::,  ^Al,,P  -PBp,^R-^Bj,^P  -^2  =0.  (2.8) 

The  corresponding  control  is 

u(r)  =  -R-»flp^.x.Fz(r).  (2.9) 

We  propose  minimizing  Jy^  for  the  worst  case  initial  condition  and  therefore  evaluate  the  local 


cost  function  as 


where  C(P )  is  the  maximum  eigenvalue  of  P  implicitly  dependent  on  the  placement  p .  This 
method  optimizes  the  local  performance  uniformly  in  initial  conditions  in  contrast  to  the 
approach  by  Kondoh  et  al  [9]  where  a  solution  sensitive  to  initial  conditions  is  proposed. 

We  state  the  placement  problem  in  the  next  sub-section. 

Problem  Statement 

Based  on  the  local  cost  measures  defined  in  the  previous  section  we  can  state  the  placement 
problem  as 

C(p)  (2.11) 

where  the  functional  C(p)  assigns  a  global  cost  for  each  placement  by  evaluating  a  suitable 
norm  on  the  function  c(p,.):Xo  -^R  ^  If  for  example,  the  set  of  equilibrium  points,  Xo .  is  con¬ 
tained  in  /?" ,  then  two  possible  choices  of  the  the  fimction  C  ( p )  are  the  average  and  worst  case: 

jc(Xo,p)dx 

(2.12) 

and 

cip^o)-  (2.13) 

The  measures  described  in  the  above  section  are  used  to  numerically  solve  the  problem  of  placing 
distributed  piezoelectric  actuators  in  a  two-link  flexible  manipulator-strictive  in  the  next  section. 

3.  Example 

The  effectiveness  of  distributed  actuators  has  been  demonstrated  for  vibration  reduction 
([17]  and  [18])  and  tracking  control  of  flexible  manipulators  [19].  Such  structures  exhibit  resi¬ 
dual  vibrations  due  to  imperfections  in  the  controller  and  disturbances  like  impacts,  thermal  heat¬ 
ing  and  micro-gravity.  For  example,  in  space  station  mobile  transporter  units,  power  and  weight 
restrictions  require  that  only  small  actuators  be  used  at  the  joints.  Consequently  the  large  numbei 
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of  gear  trains,  required  to  amplify  the  torques  provided  by  these  small  actuators,  exhibit  substan¬ 
tial  friction,  stiction  and  backlash.  Therefore  the  joint  actuators  are  ineffective  in  controlling  low 
amplitude  structural  vibrations.  The  placement  of  distributed  actuators  plays  a  critical  role  in  their 
ability  to  control  Lhese  vibrations. 

Experimental  Set-up 

The  example  considered  is  an  experimental  two-link  truss  structure  (figure  1)  built  at 
UCSB.  The  structure  has  16  spans  and  two  articulations  forming  a  planar  manipulator.  The 
trusses  are  made  of  aluminum  and  have  lumped  masses  (net  2  kg  for  each  link)  distributed  along 
their  lengths  in  order  to  lower  the  modal  frequencies  and  hence  the  control  sample  rate.  In  addi¬ 
tion.  the  first  and  the  second  links  have  tip  loads  of  3.5  and  1  kg  respectively,  and  their  lengths 
are  1.8  and  1.1m  respectively.  Actuation  consists  of  low-irtertia  dc-motors  at  the  two  joints  and 
an  active  bay  with  four  piezoelectric  actuators.  Sensing  consists  of  resolvers  at  the  joints  and  col¬ 
located  strain  sensors  on  the  four  piezoelectric  actuators.  The  bays  structural  properties  are  only 
minimaly  affected  by  the  addition  of  piezoelectric  actuators  as  the  the  Youngs  modulus  of  Lhe 
piezoelectric  actuator  (83  Gpa)  is  similar  to  that  of  the  aluminum  (77Gpa)  used  for  the  truss,  and 
the  added  mass  is  negligible  compared  to  the  lumped  masses  and  end  plates  (see  figure  2)  at  each 
end  of  the  bay.  However  these  effects  will  be  included  in  the  model  considered  for  consistency. 
The  entire  structure  is  supported  on  air  bearings  and  controlled  with  an  Intel  386-based  PC.  servo 
amplifiers  for  the  motors  and  150V  servo  amplifiers  for  the  piezoelectric  actuators. 

Modeling  Issues 

Our  interest  is  to  study  piezoelectric  actuator  placement  for  optimally  controlling  structural 
vibrations.  A  related  issue  is  the  implementation  of  controllers  for  the  distributed  actuators.  The 
problems  of  implementation  include  depoling,  nonlinearity,  hysteresis,  and  creep  effects  in  the 
actuator  [20].  Depoling  may  be  avoided  by  maintaining  the  applied  field  below  the  coercive 
field.  Within  the  depoling  limits,  the  nonlinearity  between  applied  electric  field  and  the  resulting 
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actuation  strain  may  require  the  use  of  more  complex  models  [21].  An  alternative  is  the  lineari¬ 
zation  of  this  relationship  about  the  operating  point  [22],  Creep  and  strain  rate  dependence  of  the 
actuator  become  important  at  large  strains  and  low  frequencies.  Hysteresis  also  plays  an  impor¬ 
tant  role  at  low  frequencies.  Significant  performance  improvement  over  such  behavior  is  possible 
by  commanding  the  induced  charge  rather  than  the  voltage  applied  to  the  actuator  (23).  Other 
related  issues  due  to  actuator  dynamics  are  considered  in  [4]  and  [24],  but  not  addressed  in  this 
paper.  We  note,  however,  that  piezoelectric  actuators  have  fast  dynamics  relative  to  electromag¬ 
netic  actuators  and  are  attractive  in  this  regard. 

In  this  section  we  study  the  placement  problem  to  optimally  control  structural  vibrations 
when  the  joint  motors  are  locked  because  of  friction.  In  the  flexible  manipulator  this  corresponds 
to  clamped  boundary  conditions  at  the  joints.  Our  objective  is  to  find  the  best  placement  of  the 
active  bay  in  the  truss  structure  which  minimizes  the  given  cost  functional  We  study  the  actuator 
placement  at  a  single  shoulder  joint  configuration  because  the  system  dynamics  are  independent 
of  the  shoulder  joint  angle.  Also,  the  elbow  joint  8  is  constrained  to  1 0 1  in  the  experiment. 
Then  by  symmetry  it  is  suthcient  to  study  tlie  prooiem  for  elbow  joint  angles  between  0^  and 
90^ .  Hence  the  set  of  equilibrium  states  X<,,  containing  rigid  body  configurations  with  zero  elas¬ 
tic  deflections,  is  parametrized  by  the  elbow  joint  angle  6  and  is  independent  of  the  placement  p . 

For  a  given  6  and  p  €  ,  the  linearized  system  equations  for  the  two-link  flexible  manipu¬ 

lator  generated  via  Finite  Element  Method  (FEM)  [1]  can  be  wrioen  as 

—^p,9^p  (3.1a) 

y=Cp,9X,  (3.1b) 

where  X  e  R’*  denotes  the  flexible  degrees  of  freedom,  ,  and  Xp  g,  are  the  mass  and 

stiffness  matrices  respectively.  Dp^s  includes  a  uniform  structural  damping  coefficient  (0.001)  in 

the  elastic  modes.  Hence  the  system  is  suble  and  thus  trivially  stabilizable  and  detectable. 

e  /?  Hs  the  input  voltage  to  the  actuator.  The  effect  of  the  piezoelectric  actuators  is  modeled 

as  two  point  moments  M  (figure  3)  acting  at  the  ends  of  the  actuator  ([20]  and[25])  and  is 

included  in  the  5^,9  matrix.  The  output  y  e  /?  *  is  obtained  from  a  strain  gauge  collocated  with 
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the  actuator  and  is  linear  in  the  state  [11].  From  equation  (3.1)  it  is  seen  that  at  each  p .  the  system 
can  be  considered  as  a  family  of  linear  systems  parametrized  by  0.  Noting  that  at  each  equili¬ 
brium  point  is  defined  by  Jt  =x  =  0„xi.  the  state  equation  (3.1)  can  be  rewritten  in  the  form  (2.2) 
as 


where 


z(t)=ApQ2(t)+Bp^QU{t) 
y(t)  =  Cp,QZ(t) . 


(3.2a) 

(3.2b) 

(3.2c) 


Ap.9  = 


Onxn  ^nxA 

-‘^p-'BfCp^Q  -Mp-^Dp^Q 


€  , 


(3.2d) 


and 


P-^ 


^p,9 


€^2«x1  ^ 


(3.2e) 


r 


. 


(3.2f) 


Controller  Choice 


The  two  measures  defined  in  section  2  axe  used  to  generate  solutions  to  the  placement  prob¬ 
lem  of  the  piezoelectric  actuaton.  The  passive  collocated  damping  controller  considered  can  be 
parametrized  by  a  scalar  feedback  gain  Kj  and  the  feedback  law  can  be  written  as  [  1 1  ] 


u  --Kti 


0\^ 


(33) 


While  computing  the  cost,  an  optimization  is  also  performed  over  the  scalar  controller  gain 
This  implies  that  the  local  meastire  c{p,Xo),  with  the  passive  damping  controller  is  given  by 

c(p,Xo)  ^  mpt  RcX,(Ap,x,)  •  (3.4) 

For  the  LQR  controller  based  measure,  weighting  matrix  Q  is  chosen  to  be  uniform  in  the  elastic 


- 11  - 


displacements  as 


Q  = 


xn  0/1  x/i 
^/ixrt  ^/ixn 


(3.5) 


The  choice  of  the  weight  R  on  the  control  input  depends  on  the  depoling  limit  of  the  actuator  and 
the  initial  conditions.  For  a  given  R ,  and  optimal  placement  Pop ,  the  maximum  input  voltage 
)  over  all  the  configurations  is  found.  This  variation  of  )  as  a  function  of  R  and  a 

the  bouixl  on  the  initial  conditions  i.e.  I  II  is  shown  in  figure  4.  Based  on  depoling  limits  for 
the  actuator,  the  maximum  applied  voltage  is  limited  to  300V  which  corresponds  to  an  applied 
field  of  2  MV Im .  For  a  =  le  -3  which  corresponds  to  a  maximum  transverse  elastic  deflections 
of  approximately  1cm,  R  -\e -6  falls  in  the  suitable  region,  with  maximum  input  voltages 
below  the  actuator  depoling  limit 


Simulation  Results 

For  a  given  elbow  joint  configuration  the  variation  of  die  passive  damping  and  LQR  based 
costs  with  placement  are  shown  in  figures  8  and  9  respectively.  The  optimal  placement  at  each 
configuration  is  marked  by  an  asterik.  Figure  8  shows  that  for  all  joint  configurations  the  passive 
damping  based  cost  is  minimal  when  the  placemem  is  at  the  root  of  the  first  link.  For  the  averag¬ 
ing  global  measure  defined  by  equation  (2.12).  the  LQR  based  measure  resulted  in  a  placement  at 
the  root  of  the  second  liitk.  To  illustrate  the  improvements  achieved  by  the  proper  placement  of 
the  piezoelectric  actuators,  simulations  of  the  equations  of  motions  for  the  passive  damping  con¬ 
troller  case  are  carried  out  Note  that  there  are  an  uncountably  infinite  set  of  possible  initial  condi¬ 
tions,  and  hence  as  many  possible  system  responses.  A  major  advantage  of  our  formulation  is  that 
the  design  is  optimized  over  an  entire  set  of  initial  perturbations  with  the  same  energy.  The  worst 
case  vibration  response,  for  the  passive  damping  case,  over  all  configurations  and  all  perturba¬ 
tions  of  a  fixed  energy  is  shown  in  figure  10,  which  shows  the  decay  of  the  systems  energy 
(kinetic  energy  +  potential  energy)  for  different  placements.  The  rate  of  vibrational  energy  decay 
is  maximum  when  the  actuators  are  placed  at  the  first  bay.  In  theory,  this  decay  rate  can  be  arbi- 


irarily  chosen  by  appropriate  feedback,  but  at  the  cost  of  larger  inputs  to  the  piezoelectric  actua¬ 
tors.  To  avoid  the  saturation  of  these  actuators,  the  input  is  penalized  in  the  LQR  controller  based 
design  measure.  The  average  variation  of  these  costs  over  the  different  placements  is  shown  in 
figure  1 1  and  the  resultant  placement  of  the  actuator  with  the  LQR  controller  is  at  the  the  root  of 
the  second  link. 

Discussion  of  Results 

.As  the  joint  angle  9  varies  from  0  to  90® .  there  is  a  substantial  variation  in  the  system 
modal  frequencies.  This  is  represented  in  figure  5,  where  the  variation  of  the  lowest  modal  fre¬ 
quency  with  elbow  joint  angle  is  shown.  These  significant  (40%  increase)  changes  in  the  modal 
frequencies  might  deem  uniformly  good  placements  impossible.  The  uniformity  of  placement 
location  is  a  desirable  result  because  it  implies  that  a  a  single  placement  is  effective  in  reducing 
vibrations  at  the  different  configurations.  The  placement  based  on  passive  damping  controller  is 
seen  to  be  invariant  with  joint  configuration,  when  the  LQR  based  measure  is  used,  the  resultant 
global  optimal  placement  (at  the  root  of  the  second  link)  is  not  the  best  for  certain  configurations. 
For  example,  when  the  structure  is  fuUy  extended,  i.e.  the  elbow  joint  angle  is  zero,  the  optimal 
placement  of  the  active  bay  (for  this  particular  configuration)  with  an  LQR  controller  is  at  the 
root  of  the  first  link.  However  the  increase  in  cost  due  to  placing  the  actuator  at  the  root  of  the 
second  link  is  negligible  (1%,  figure  9).  Hence  the  placement  is  not  very  sensitive  to  gross 
changes  of  system  configuration.  This  near  optimality  over  all  configurations  is  attributed  to  the 
relative  invariance  of  the  rotational  components  of  the  mode  shapes,  and  is  discussed  next 

The  placement  invariance  is  due  to  the  faci  that  in  our  example  the  eigenvectors  are  rela¬ 
tively  invariant  with  the  joint  configuration.  This  is  illustrated  in  figure  6,  where  the  projection  of 
the  eigenvectors,  associated  with  the  least  two  eigenvalues,  onto  the  rotary  elastic  displacements 
at  the  FEM  nodes  of  the  manipulator  model  is  showa  These  correspond  to  the  spatial  derivative 
of  the  projection  of  the  eigenvectors  onto  the  transverse  elastic  displacements  of  the  manipulator. 
Note  that  the  rotary  mode  shapes  shown  in  figure  6,  vary  little  with  joint  angle.  As  the  placement 
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varies  over  the  length  of  the  manipulator,  the  relative  control  over  each  mode  and  its  variation  are 
determined  by  these  mode  shapes  [24],  Consequently,  the  invariance  of  these  mode  shapes  with 
the  joint  angle  implies  the  relative  invariance  of  the  distribution  of  effective  control  over  -'ifferent 
modes.  It  is  this  control  distribution  of  the  different  modes  tliat  determines  the  cost  for  the  possi¬ 
ble  placement  and  therefore  its  invariance  results  in  a  uniformly  good  placement 

The  performance  measure  based  on  passive  damping  is  determined  by  the  distance  of  sys¬ 
tem  eigenvalues  from  the  imaginary  axis.  This  distance  is  an  estimate  of  the  slowest  decay  rate  of 
elastic  perturbations  in  the  structure.  In  our  example,  the  decay  rate  tends  to  increase  with  the 
elbow  joint-angle  as  shown  in  figure  7.  This  is  mainly  due  to  the  increase  in  the  system  natural 
frequencies,  but  with  constant  structural  damping.  For  such  a  system,  the  elastic  vibration  modes 
associated  with  higher  modal  frequencies  tend  to  have  higher  decay  rates.  Thus  in  the  case  of  the 
passive  controller  based  performance  measure,  it  is  the  variation  of  the  eigenvalue  associated 

with  the  lowest  frequency  that  detemiines  the  cost  and  therefore  the  placement  To  control  a  sin- 

% 

gle  eigenveaor,  the  optimal  placement  is  at  the  location  of  maximum  strain  [26]  where  the  effec¬ 
tive  control  over  that  mode  is  maximized.  Hence  the  passive  damping  based  measure  yields  the 
placement  at  the  first  span  of  the  structure.  As  discussed  above,  this  placement  is  uniformly  good 
for  all  joint  configurations  because  the  rotcuy  mode  shapes  are  relatively  invariant 

The  LQR  based  performance  measure  resulted  in  a  placement  where  the  control  objective  to 
minimize  the  quadratic  cost  functional  given  by  equation  (2.7)  is  achieved.  The  optimization  of 
this  quadratic  cost  functional  results  in  a  compromise  placement  between  high  decay  rates  of  per¬ 
turbations  and  small  control  effort  Note  that  the  placement  obtained  on  using  this  performance 
measure  is  relatively  good  over  all  configurations.  Thus  the  invariance  of  the  mode  shapes 
resulted  in  placements  which  were  near-optimal  for  each  of  the  possible  configiuations.  In  the 
next  section  we  study  a  class  of  AFS  where  such  urtifoimly  good  actuator  placements  can  be 
expected. 


4.  Mode  Shape  Invariance  in  Structures  with  Joint  Dominated  Inertias 
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In  this  section  we  show  that  "high  frequency"  mode  shapes  are  configuration  invariant  in 
AFS  with  joint  dominated  inertias.  We  also  show  that  this  invariance  results  in  actuator  place¬ 
ments  which  are  uniformly  good  over  different  configurations.  We  begin  with  the  following 
lemma  dealing  with  the  eigenvalues  and  eigenvectors  of  a  singularly  perturbed  eigenvalue  prob¬ 
lem  arising  in  our  analysis. 


Lemma 

Consider  the  problem  of  finding  the  eigen  pair  (X,  v ).  v  -  [v  j  v  2]^ ,  which  satisfies 


XAn+5ii  X4i2+^i2 

Vl 

X-421+521  X/l22(£b^^22‘*’‘®22 

V2 

where  Aii€/?''^./t22€/?'""^.the  otter  submatrices  A  21.  An,  etc.  have  compatible  dimen¬ 
sions,  and  det(/l  22)  ’*0- 

Then 

(1)  The  eigenvalues  of  the  singularly  perturbed  system  are  close  to  the  roots  of 
det  {lambdaA  ii  -i-fln)  =  0,  or  are  close  to  zero.  More  precisely,  given  r  >0,  there  exists 
eo(r )  >  0  such  that  0  <  e  ^  eo(r)  implies 

lX,Ce)-Pil  <r,  I  =  l,...,rt,  and  (5.2' 

lX,(e)l  <r,  i  =n-i-l,...,n+m  ,  (5.3) 

where  {pi,....pn)  arette  rootsofdet[(i4ii+5ii]  =  0.  and  (X,  (e),  v,  (e))  are  solutions  to 
(5.1)  (renumbered  if  necessary),  with  the  dependence  on  £  explicitly  stated. 

(2)  If  X,  is  an  eigenvalue  satisfying  (5.2)  is  not  too  small  (i.e.  P,-  0).  then  the  norm  of  v,  2  is 

smaU  relative  to  v,,i.  That  is,  for  aU  pi  0.  given  r  >0,  there  exists  ei(r)>0  such  that 
0  <  e  ^ei(r)  implies 
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>lvi.2(e)*2  <  '' llvi.i(e)il  2.  i=l . n  (5.4) 

where  (X, ,  V,' )  is  the  solution  to  (5. 1)  such  that  II  Vj  II  2  =  1. 


Proof 

(1)  This  follows  from  standard  arguments  in  singular  perturbations  theory, 
for  example,  see  [27]. 


(2)  if  Pi  *  0,  from  equation  (5.1), 


^XA22/e  +  ^2  +  522  '  Vi,2  =  -j^Xj4  21+^21]  V/.i 


(5.5) 


By  continuity  of  matrix  inversion  in  a  neighborhood  of  ^422.  given  5i  >  0  there  exists 
£»  .2(5i)  >  0  such  that,  whenever  0  <  e  ^Ei.2(5i), 


ll[|3iA22  +  ePi522  +  e522]"‘ll2  <  ei^^0^(l  +  5i).  (5.6) 

Choose  Ei  o  (from  first  part  of  the  theorem)  such  that  0  <  e  ^Ei,o  implies  that 
IX/ -Pi  I  <  l^.UtEi.3  -  min(Ei.o,Ei.2(l)). 


Then 


IVi,2l2  < 

whenever©  <  E  ^  E/, 3, where 


(5.7) 


K  A 


max 


Pi  ^21+^21+ 

If  /IT  =  0  then  choose  Ei  =  E3,  else  choose  Ei  =  .  njin 

Then  for  all  E  such  that  0  <  E  ^  Ei, 


82.  I 


Pi  i42i+B21”  I  Pi  I 

r  I  Pi 


(5.8) 


^i,  3  I 


unj^T^ 


II  Vi_2ll  1  <  r  llv/jll  2 


(5.9) 
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and  this  completes  the  proof.  Q 


Next  we  use  the  lemma  to  show  that  for  planar  AFS  (figure  12)  1)  consisting  of  interconnected 
beams  and  2)  having  joint  dominated  inertias,  the  modal  frequencies  can  be  partitioned  into  two 
groups.  One  "low  frequency"  set  corresponds  to  eigenvalues  that  tend  to  zero  as  the  joint  inertias 
become  large.  The  other  group  of  relatively  "higher  frequency"  modes  corresponds  to  the  eigen¬ 
values  of  the  AFS,  simply  supported  at  the  joints.  We  show  that  these  high  frequency  mode 
shapes  are  joint  configuration  invariant  Due  to  bandwidth  restrictions,  joint  actuators  on  the  AFS 
are  ineffective  in  controlling  these  high-frequency  vibrations,  but  the  distributed  actuators  are 
highly  effective  in  controlling  them.  Due  to  the  invariance  of  these  mode  shapes  in  AFS  with 
joint  dominated  inertias,  we  can  expect  to  find  distributed  actuator  placements  that  are  uniformly 
good  over  all  joint  configurations. 


To  show  the  mode  shape  invariance  we  generate  a  dynamic  model  for  the  beams  Using  the 
Finite  Element  Method  (FEM),  with  degrees  of  freedom  (dof)  as  shown  in  figure  13,  each  of  the 
individual  beam’s  mass  and  stiffiiess  matrices  can  be  assembled  to  obtain  the  total  mass  and  stiff¬ 
ness  matrices,  M,  and  /iff ,  of  the  structure.  The  system  equations  can  be  written  as 


M,  X  +K,X  F,  (5.10) 

where  F  is  the  external  force.  X  can  be  partitioned  into  X  i  and  X2,  where  the  later  consists  of 
the  translational  dof  at  the  joints.  Moreover,  the  matrices  Mi  and  K,  can  be  partitioned  such  that 
the  dynamic  equations  become 


where 


Mu 

Mi2 


1 

Kn 

K21 

Xx 

X2 

'T 

Kx2 

K22 

Xi 

=  BF 


(5.11) 


X2  = 


^(1) 

^(2) 

0(2) 


0(") 


(5.12) 
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represents  the  translational  degrees  of  freedom  at  the  joints.  Since  all  degrees  of  freedom,  except 
those  of  the  Joints,  are  defined  in  local  beam  co-ordinates,  only  the  joint  related  inertias 
{M\2,  iV/21,  M22)  and  stiffnesses  (^12.  ^^21.  ^22)  are  dependent  on  the  joint  angles,  0*.  How¬ 
ever,  and  ATue/?"^,  which  also  include  terms  corresponding  to  the  rotary 

degrees  of  freedom,  are  independent  of  9*.  The  submatrix  A/ 22  e  consists  of  two 

components—  the  contribution  from  the  joint  masses  M joints  ^  and  the  contribution  from  the  link 

M* 

inertias,  522-  Since  M joints  ‘s  large  by  assumption,  we  express  it  as  “Y".  where  e  is  small. 
Therefore  A/ 22  can  be  written  as 


A^22=  ^  '♦•^2  (5.13) 

Note  that  822  depends  on  0* ,  and  M  *  does  not  For  the  system  described  by  5. 1 1 ,  its  eigenvalues 
Xi  and  the  corresponding  eigenveaors,  v,- ,  satisfy 


lOi-P,!  =0  (5.15) 

lOj  I  =0  (5.16) 

where  pi  satisfies  det  A/n  -ATnj  =0-  Moreover,  in  the  limit  as  e-»0,  the  eigenveaors  vj , 

corresponding  to  tti .  are  those  of  the  structure,  simply  supported  at  the  joints.  Hence 


Vi.2  =  Q. 


(5.17) 
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Stnce  M  n  and  AT  n  are  independent  of  joint  configurations,  equation  5.18,  and  consequently  the 
eigenpairs  (a,-,  are  also  configuration  invariant  in  AFS  with  joint  dominated  inertias.  Hence 
the  associated  "high  frequency"  mode  shapes  and  in  particiUar  their  rotary  components  do  not 
change  much  with  joint  configuration  if  £  is  small.  This  leads  to  placements  that  are  good  over 
large  variations  in  joint  configurations  as  discussed  in  section  3.  The  above  arguments  can  also  be 
extended  for  spatial  AFS  with  complex  structural  connections  between  joints.  Thus  we  can  expect 
to  find  uniformly  good  distributed  actuator  placements  in  AFS  with  joint  dominated  inertias. 

5.  Conclusion 

In  this  paper  we  formulated  the  actuator  placement  for  Articulated  Flexible  Structures.  Two 
closed  loop  performance  criteria  were  considered  to  derive  objective  functions  for  optimum 
placement  of  actuators  in  such  systems.  The  procedures,  developed  independent  of  initial  condi- 
dons  and  fonnulated  as  eigen-value  problems,  are  easy  to  compute.  In  the  example  two-link 
manipulator,  these  measures  are  computationally  tractable  and  effective  to  solve  the  distributed 
actuator  placement  problem  for  optimal  structural  vibration  reductioa  An  important  observation 
is  that  the  mode  shapes  associated  with  nodal  rotations  for  the  example  two-link  flexible  maitipu- 
lator  are  relatively  invariant  with  joint  configuration.  This  invariance  resulted  in  a  placement 
which  is  effective  in  reducing  vibrations  over  all  the  different  configurations.  We  further  investi¬ 
gate  the  invariance  of  the  mode  shapes  and  identify  a  class  of  articulated  structures,  with  joint 
dominated  mass  distributions,  where  we  can  expect  such  uniformly  good  placements. 

The  measures  suggested  in  this  paper  can  also  be  used  for  actuator  placement  in  set  point  con¬ 
trollers  of  nonlinear  systems.  In  this  sense  the  approach  is  a  modest  attempt  at  the  larger  problem 
of  nonlinear  actuator  placement  The  existence  and  computability  of  optimal  designs  for  general 
trajectory  tracking  in  nonlinear  systems  is  the  subjea  of  further  work. 
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Figure  1 .  Experimental  Smart  Flexible  Structure 
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Figure  3.  Equivalent  FEM  representation 
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Figure  5.  Variation  of  First  Modal  Frequency  of  the  Structure 
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Figure  6a.  Variation  of  Rotary  Components  of  the  1st  Eigenvector 
with  Joint  Configuration 
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Figure  6b.  Variation  of  Rotary  Components  of  the  2nd  Eigenvector 
with  Joint  Configuration 
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Figure  7.  Variation  of  Eigenvalues  with  Joint  Configuration 
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Figure  9.  Variation  of  LQR  Based  Cost  Measure  over 
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Redundant  Actuators  to  Achieve  Minimal  Vibration  Trajectory 
Tracking  of  Flexible  Multibodies:  Theory  and  Application 

SANTOSH  DEVASIA  and  EDUARDO  BAYO 

Mechanical  &  Environmental  Engineering  Department,  University  of  California,  Santa  Barbara,  CA  93106 


Abstract.  We  address  the  problem  of  inverse  dynamics  for  flexible  multibodies,  which  arises,  in  trajectory 
irxking  control  of  flexible  multibodies  such  as  space  manipulators  and  articulated  flexible  structures.  Pre¬ 
vious  research  has  resolved  this  trajectory  tracking  problem  by  computing  the  system  inputs  for  feedfor¬ 
ward  control  of  actuators  at  the  joints.  Recently,  the  use  of  distributed  actuators  like  electro-strictive  actua¬ 
tors  in  flexible  structures  has  introduced  a  new  dimension  to  this  U'ajectory  tracking  problem.  In  this  paper 
we  optimaly  utilize  such  actuators  to  aid  joint  actuators  for  tracking  control,  and  introduce  a  new  inverse 
dynamics  scheme  for  simultaneously  (1)  tracking  a  prescribed  trajectory  and  (2)  minimizing  ensuing  elastic 
deflections.  We  apply  this  scheme  for  trajectory  tracking  of  a  two-link  two-joint  planar  manipulator  with 
joint  motors  and  distributed  electro-strictive  actuators.  Experimental  results  are  presented  to  contrast  our 
new  scheme  with  other  existing  methods. 


Key  words:  Flexible  articulated  structures,  multibody  dynamics,  actuator  redundancy,  trajectory  tracking, 
inverse  dynamics. 


1.  Introduction 

Inverse  dynamics  provides  an  excellent  means  for  trajectory  tracking  of  flexible  multibo¬ 
dies.  Methods  to  precompute  the  actuator  inputs  required  to  exactly  track  a  given  output 
trajectory  of  a  control  point  on  a  single  link  flexible  arm  were  provided  by  Bayo  [1]  and 
by  Kwon  and  Book,  [2].  The  solution  for  multi-link  open-chain  applications  has  been 
proposed  by  Bayo  et  al.  [3]  where  the  inverse  dynamics  and  kinematics  produce  bounded 
feedforward  inputs  for  actuators,  like  motors  at  articulations  and  joint  angles,  to  track  a 
reference  point  on  the  strucmre.  The  closed-chain  planar  case  has  been  recently  presented 
by  Ledesma  and  Bayo  [4],  If  the  sensors  and  actuators  are  non-collocated  then  the  flexi¬ 
ble  structure  has  nonminimum  phase  dynamics  and  the  only  stable  inverse  dynamics  solu¬ 
tion  to  the  tracking  problem  is  non-causal  [5],  Once  a  trajectory  is  specified,  the  feedfor¬ 
ward  control  input  obtained  by  inverse  dynamics  for  exact  trajectory  tracking,  has  a 
unique  bounded  solution.  Therefore,  the  subsequent  elastic  strucmral  vibrations  induced 
on  the  structure  (except  at  the  control  point  where  these  vibrations  are  zero)  during  the 
trajectory  tracking  motion  are  also  defined  uniquely.  These  vibrations  could  be  detrimen¬ 
tal  to  the  performance  of  sensitive  on-board  systems  and  hence  it  is  desirable  to  minimize 
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them.  For  some  rime,  distributed  actuators  have  been  strongly  considered  successfully 
used  to  to  control  structural  vibrations  ([6]  and  [7] ).  Recent  success  in  their  experimental 
use  [8]  motivates  the  use  of  such  actuators  to  aid  joint  actuators,  like  motors,  for  trajec¬ 
tory  tracking. 

The  trajectory  tracking  objective  can  be  accomplished  by  the  point  actuators  alone 
[9]  and  in  this  sense  the  distributed  actuators  are  redundant.  In  this  paper  we  introduce 
the  concept  of  using  the  extra  actuation  available  through  the  distributed  actuators  in  the 
structure  to  not  only  satisfy  the  trajectory  tracking  constraint,  but  also  minimize  the 
accompanying  elastic  displacements  during  the  motion.  A  new  inverse  dynamics  method 
is  presented  to  compute  the  feedforward  inputs  which  includes  the  cases  of  redundantly 
actuated  structures.  This  use  of  distributed  actuators  for  end  effector  trajectory  control  is 
contrasted  with  the  use  of  only  the  joint  actuators  in  feedforward.  The  method  proposed 
here  is  shown  to  substantially  reduce  the  induced  vibrations  in  the  structure.  The  results 
are  experimentally  verified  using  a  flexible  two  link  articulated  truss  structure  with  distri¬ 
buted  eiectro-strictive  actuators  and  joint  motors.  We  also  present  a  novel  strain  based 
control  scheme  for  eiectro-strictive  actuators  which  is  very  effective  in  reducing  hyster- 
isis  and  other  non-linear  effects  pre-dominant  in  such  actuators. 

The  remainder  of  the  paper  is  organized  in  the  following  format.  Modeling  of  flexi¬ 
ble  mulribodies  with  joint  and  distributed  actuators,  formulation  of  the  problem  and  the 
solution  methodology  are  presented  in  Section  2.  Section  3  deals  with  an  application  of 
the  proposed  method  to  a  two-link  flexible  truss  and  presents  experimental  results.  Our 
conclusions  are  made  in  Section  4. 

2.  Formulation 

The  inverse  dynamics  of  a  flexible  multi-body  is  a  non-linear  problem.  We  solve  this 
non-linear  problem  recursively,  one  element  of  the  multi-body  ai  a  time.  This  algorithm, 
proposed  by  Bayo  in  [3],  for  general  multi-body  inverse  dynamics  involves,  1)  studying 
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an  individual  component  (link)  in  the  chain;  2)  coupling  the  equations  of  the  individual 
links;  and  3)  recursively  converging  to  the  desired  actuator  inputs  and  corresponding  dis¬ 
placements.  Following  this  general  procedure,  we  propose  a  new  scheme  which  incor¬ 
porates  distributed  actuators  in  the  solution  of  the  inverse  dynamics  problem.  This 
approach  is  presented  in  the  following  section. 

2.1  Equation  of  motion  of  an  individual  link 

We  start  by  studying  the  dynamics  of  a  single  link.  To  simplify  the  equations,  we  con¬ 
sider  a  link  with  a  revolute  joint,  however  the  formulation  is  similar  for  a  general  element 
of  a  given  multibody  with  other  types  of  joints  like  the  translational  type.  The  flexible 
link  depicted  in  Figure  1  forms  part  of  a  multi-link  (multi-body)  system,  and  has  a  total 
length  L,  mass  per  unit  length  m,  area  moment  of  inertia  /,  cross-sectional  area  -4, 
Youngs  modulus  E ,  shear  modulus  G ,  and  shear  coefficient  k .  A  tip  mass  of  magnitude 
Mj  is  attached  at  one  end,  and  a  hub  with  rotary  inertia  In  at  the  other  end.  A  point  P  at  a 
distance  x  from  the  center  of  the  hub  has  undergone  elastic  deflections  of  magnitudes 
and  Uy ,  and  rotation  of  9 ,  defined  with  respect  to  a  nominal  position  characterized  by  the 
moving  frame  (ei,e2)  that  rotates  at  a  specified  angular  velocity  and  acceleration  (O;,  and 
a/,  respectively,  and  a  linear  acceleration  of  a*.  This  definition  of  motion  with  respect  to 
the  nominal  frame  permits  the  linearization  of  the  single  link’s  dynamic  equations.  Incor¬ 
porating  the  kinematic  model  followed  by  Naganathan  and  Soni  [10],  the  linear  and 
angular  accelerations  ( ap  and  a.p  respectively )  can  be  written  in  vectorial  notation  as 

ap  =  (Oft  x((Oh  xr)  +  akxr  +2(Of,  xvrei  +  +  Orel  ( 1  a) 

ap  =ah  +0  (lb) 

where  r  ^  (x  +  Ujc)fi(f)  ■hUye2(0  .  VrW  is  the  relative  velocity  of  point  P ,  whose  rela¬ 
tive  acceleration  is  a^gi.  The  components  of  acceleration  vectors  satisfy  the  following 
equations 


Ox  =-Oih^Ox  -ajtUy  -2(Of,Uy  +Ux  -(Okh  +afa 
Oy  =-af,Ux  -(Oh\  +2(Of,Uz  +Uy  -af,x  +ahy 


(2a) 

(2b) 
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ap=aH+Q  (2c) 

where  the  subscripts  x  and  y  denote  components  along  the  e\  and  ei  directions.  Now 

using  the  Timoshenko  beam  theory,  which  includes  the  effects  of  shear  deformation  and 
rotary  inertia,  the  principle  of  virtual  displacements  can  be  used  directly  to  generate  the 
following  equations  of  motion 

|[m  Qx  5ui  +m  Gy  6uy  +  m  T| 2 Op  59  ]dx  +Ih{o.h  +  0a )50a  +  + 

|[£/  Q'5Q'  +  GAk(Q  -u'y)5(Q  -u'y)  +  EAu'x  5u',  ]dx  = 

T  50 >,  +Rty  5u[y  +/?ct  5uu  +Tt  50,  (3) 

where  ri  is  the  radius  of  gyration  of  the  section.  The  subscripts  h  and  t  indicate  the  hub 
and  tip  respectively,  and  the  symbol  '  denotes  the  spatial  derivative.  6ux,  &Uy  and  50 
represent  a  set  of  vinual  elastic  displacements.  T  is  the  unknown  torque  to  be  applied  at 
the  hub  so  that  the  prescribed  tip  motion  is  obtained.  Note  that  the  hub  acceleration  is 
decomposed  into  the  nominal  acceleration  and  the  acceleration  due  to  elastic 
deflections.  Also  observe  that  the  reactions  at  the  hub  do  not  have  any  effect  in  the  total 
virtual  work.  As  shown  later,  this  constraint  is  met  by  imposing  the  constraint  that  the 
hub  moves  ilong  the  nominal  path  without  any  elastic  deformations.  The  displacement 
field  of  equation  (3)  can  be  discretized  using  the  finite  element  method  (FEM)  as  follows 

=  ^,(jc)Ux‘(f)  ;  Uy (x ,r )  =  (x ‘ (r )  ;  0(x,r)  =  ^,(x)0‘(r)  (4) 

where  //,  are  interpolation  functions  whose  order  depends  on  the  number  of  nodes,  N,  in 
the  elements  ;  u*  ‘  ,Uy‘  and  0  ‘  indicate  the  nodal  deflections.  It  should  be  noted  here  that 
other  alternative  approaches,  like  the  modal  superposition  method  may  be  followed  to 
generate  the  system  equations.  The  approach  chosen  is  dependent  on  the  ease  and  accu¬ 
racy  of  the  particular  method.  Substituting  equations  (2)  and  (4)  in  the  virtual  work 
expression  (equation  3),  and  following  standard  procedures  for  the  formulation  and 
assemblage  of  element  matrices  [11],  the  equation  of  motion  can  be  written  as  [3] 
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Mz  +  C+Cc(C0h)  2+  K  +  z  =  BjT  +  BpVp  F .  (5) 

WTiere  2  is  an  vector  of  the  finite  element  degrees  of  freedom.  M  and  K  belong  to 
and  are  the  conventional  finite  element  mass  and  stiffness  matrices  respectively; 
Cc  and  Kc  e  R''^  and  are  the  time  varying  Coriolis  and  centrifugal  stiffness  matrices, 
respectively.  The  matrix  C  represents  the  internal  viscous  damping  of  the  material. 
T  is  the  unknown  joint  actuation.  F  e  Z?"  contains  the  reactions  at  the  end  of  the  link. 


and  the  known  forces  produced  by  the  rotating  frame  effect.  The  distributed  actuator 
inputs  Vp  G  are  the  equivalent  nodal  forces  at  the  FEM  degrees  of  freedom,  where 
np  is  the  number  of  distributed  actuator  inputs.  and  Bp  are  constant  matrices  input 
influence  matrices  of  dimensions  Z?"  and  Z?'*’^,  respectively.  The  set  of  finite  element 
equations  (5)  is  partitioned  as  follows 


M  2,  +  C+Cc(03;,)  2,'  +  ^  (a/,  ,co>, )  2,  = 


0  7+  Bp^  Vp  +  F,  (6) 

.oj 

where  9;,  is  the  elastic  rotation  of  the  hub,  2,  is  the  elastic  deflection  at  the  tip  in  the  y 
direction,  and  the  other  n-2  finite  element  degrees  of  freedom  are  included  in  the  vector 
2, .  The  force  vector,  F,  and  the  Bp  and  Bj  matrices  are  also  partitioned  similarly. 


2.2  Minimization  Objective 

The  requirement  is  to  accurately  track  the  end  effector  of  the  link  along  the  given  nomi¬ 
nal  trajectory  without  overshoot  and  residual  vibrations.  If  the  distributed  actuators  were 
not  available,  then  the  exact  tip  trajectory  tracking  requirement  defines  the  joint  input 
torque  7.  Our  objective  is  is  to  use  the  additional  actuation  available  through  the  distri¬ 
buted  actuators  to  reduce  the  ensuing  structural  vibrations  at  locations  away  from  the 
control  point  during  this  motion  by  minimizing  7(7, ),  a  measure  of  elastic  deflections 
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in  the  structure  defined  as  follows 


Jijyp)  (7) 

— oo 

Mathematically  the  objective  can  be  stated  as 

min  .J{T,Vp).  (8) 

(T.v,)  €  r  ^ 

Where  T  is  the  set  of  all  pairs  of  stable  actuator  inputs  that  when  used  to  actuate  the 
system  defined  by  equation  (6)  yields  z,  (r )  =  0  for  all  t. 


2.3  Solution  Methodology 

An  iterative  scheme  is  described  below  for  each  link.  Equ»fion  (6)  can  be  rewritten  as 


Mz +Cz +Kz  =  BTT+BpVp +F -Cc((iih)i  -  ATe (a;, ,0)* )  z  (9) 

where  the  time  dependent  Coriolis  and  centrifugal  terms  are  kept  on  the  RHS  of  the 
equation.  A  study  of  the  influence  of  the  Coriolis  and  centrifugal  effects  on  the  inverse 
dynamics  has  been  presented  by  Gofron  and  Shabana  [12].  The  iteration  procedure  starts 
with  the  absence  of  the  last  two  terms  involving  Cc  and  Kc  in  the  right  hand  side.  Then, 
the  system  of  equations  can  be  transformed  into  independent  sets  of  simultaneous  com¬ 
plex  equations  by  means  of  the  Fourier  transform.  For  each  of  the  evaluation  frequency 
0),  equation  (9)  becomes 


A 

•  • 

-  - 

■ 

r 

Zh 

K 

A 

T 

h 

Bp. 

M  +  J-C 
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(0^ 
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+ 

Pi 

+ 

Bp. 

/\ 

Zt 

0 

A 

Bp. 

A 

A 

(10) 


where  the  symbol  stands  for  Fourier  transform,  and  F  represents  the  known  forcing 
terms.  After  the  first  iteration  it  will  also  include  the  updated  contributions  from  the 
Coriolis  and  centrifugal  terms  appearing  in  the  RHS  of  equation  (9).  For  any  co  ^  0,  the 
matrix 


M  + 


(11) 
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is  a  complex,  symmetric  and  invertible  matrix.  For  o)  =  0  the  system  undergoes  a  rigid 
body  motion  and  H  ^  M ,  the  positive  definite  invertible  mass  matrix.  Let  G  4  //-i. 
Then  the  above  equation  can  be  re-written  as 


(12) 


The  condition  that  the  tip  should  follow  the  nominal  motion,  is  equivalent  to  zi  =0  for  all 
CO .  This  induces  a  relationship  between  the  joint  actuation  and  the  distributed  acmator 
inputs  and  is  obtained  from  the  last  row  of  the  previous  equation. 
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(P  +  BpVp). 


(13) 


Substituting  this  expression  for  the  input  hub  torque  in  equation  (8)  and  using  the  pro¬ 
perty  that  z  =  -co^  z  yields 


Where 


and 


z  =  -X(AVp+B). 
co^ 


/I  ^[-G,h-^GBt(G,hGuGu)  +  G]Bp 


B  ^[-GtH-^GBT(.G,HG,iG„)  +  G]F. 


(14) 


(15) 


(16) 


Next  we  determine  Vp.  Using  Parseval’s  theorem,  minimizing  J(T,Vp)  in  equation  (8) 
is  equivalent  to  minimizing  Hz II  2^  at  each  co.  This  is  a  standard  least  squares  approxima¬ 
tion  problem  [13]  and  has  one  of  the  solutions  for  the  distributed  actuator  inputs  as. 


Vp=-U 


2-1  0 
0  0 


V*B  (17) 

where  2 ,  U  and  V  define  the  standard  singular  value  decomposition  of  A  as  follows 


V*AU  = 


2  0 
0  0 


(18) 
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and  the  conjugate  transpose  matrix  operator  is  denoted  by  * .  This  solution  is  unique  in 
the  sense  that  it  also  minimizes  II  Vp  II  2  over  all  possible  II  z  I  2^  minimizing  solutions.  In 
addition  if  A  has  rank  np ,  which  is  the  number  of  distributed  actuator  inputs,  then  the 
least  squares  approximation  yields 

Vp=-{A*A)-^A*B  .  (19) 

A  sufficient  and  necessary  condition  for  A  to  have  rank  np  is  given  next. 


Lemma 


Proof 


Rank  [Bj  ]  =  1 


rank  [A]=np  iff  rank  [Bj-  \  Bp]  =  np+l 


rank  [GBj(Gth  G,i  G„)]  =  1 


(20) 


■  rank 


A  -  [-G,f,  ^  GBjiGth  G,i  G„)  +  G]  >  n -I 


because  G  is  invenible  with  rank  n .  Since  =  [1  0  0]* ,  it  follows  that  [10  0]*  lies  in 
the  null  space  of  A  .  Hence  rank  A  is  n-1.  Noting  that  A  =  A  the  lemma  follows 
easily.  □ 


The  above  lemma  states  that  if  all  the  columns  of  the  input  matrices  Bj  and  Bp  are 
independent,  then  the  solution  for  Vp  is  given  by  equation  (19),  thus  the  computationally 
expensive  singular  value  decomposition  given  by  equation  (18)  can  be  avoided.  The 
independency  of  the  input  influence  matrix  columns  imply  that  the  different  modes  of  the 
structure  are  acted  upon  in  different  ratios  by  the  different  inputs.  If  the  actuations  of  two 
inputs  were  similar  on  all  the  modes,  then  they  can  be  lumped  together  to  be  considered 
as  a  single  input  in  the  computations.  This  distribution  of  actuation  effort  depends  on  the 
mode-shapes  of  the  structure  and  the  actuator  placement,  and  can  be  modeled  easily 
using  FEM  [14].  In  the  problem  at  hand,  the  best  placement  for  a  given  trajectory 
depends  not  only  on  the  structure  but  also  the  component  frequencies  of  the  desired 
motion.  Thus  the  optimal  placement  of  the  actuator  would  in  general  be  trajectory 
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dependent.  In  most  structures,  such  a  freedom  of  changing  the  actuator  placement  with 
the  prescribed  trajectory  is  not  available.  The  design  of  such  structures  with  fixed  actua¬ 
tor  placements,  is  based  on  minimizing  the  induced  structural  vibrations  over  sets  of  dis¬ 
turbances  with  a  specified  energy  [14].  In  systems  where  the  required  motions  are  largely 
repetitive,  the  actuator  placement  can  be  optimized  over  a  specified  set  of  trajectories; 
this  warrants  a  separate  treatment  to  be  considered  in  a  future  work. 

The  iteration  procedure  continues  as  follows.  The  corresponding  joint  torque  com¬ 
ponent,  f  is  evaluated  from  equation  (13).  An  inverse  Fourier  transform  evaluation  yields 
the  feedforward  inputs  and  completes  the  first  iteration.  The  results  of  this  first  iteration 
are  the  joint  torques,  and  the  distributed  inputs  Vj}.  Next  a  forward  dynamics  analysis 
is  carried  out  to  compute  and  Q .  F  in  the  RHS  of  equation  (9)  is  updated  and  the 
process  is  repeated  to  find  the  new  input  torques  and  voltages.  The  process  is  stopped  at 
the  n iteration  if  II T"  -7"-! H  2  +  •  "  ~^p ""*1  2  <  £  *  where  e  is  some  small  positive 

constant.  It  may  be  noted  that  for  slow  motions  the  terms  involving  Kg  and  Cg  are  small 
relative  to  the  other  terms  in  equation  (5)  and  the  iterations  converge  in  a  few  steps  [3]. 

2.4  Algorithm  for  the  Multi-Link  Case 

In  the  previous  sub-section  the  procedure  to  evaluate  the  joint  actuations  of  a  single  link 
was  presented.  This  is  recursively  extended  for  multi-link  flexible  manipulators.  An  algo¬ 
rithm  is  presented  below: 

1 .  Define  the  nominal  motion  (Inverse  Kinematics  of  rigid  manipulator). 

2.  For  each  link  j ,  starting  from  the  last  one  in  the  chain: 

a)  Compute  torque  (or  force)  TJ  and  distributed  actuator  inputs  Pyj 
imposing  z,j  =0  (Section  2) 

b)  Compute  the  link  reaction  forces  Rl  from  equilibrium. 

3.  Use  equation  (5)  to  compute  the  elastic  displacement  and  joint  angles. 
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4.  Compute  the  inputs  for  the  next  link,  j-\. 

This  concludes  the  methodology.  In  the  next  section  we  present  an  application  to  a 
two-link  flexible  manipulator. 

3.  Experimental  Verification 

An  experimental  truss  structure  developed  at  UCSB  is  shown  in  Figure  2.  The  structure 
has  16  spans  and  two  articulations  forming  a  planar  manipulator.  The  trusses  are  made  of 
aluminum  and  have  lumped  masses  (net  2  Kgs  for  each  link)  distributed  along  their 
lengths  in  order  to  lower  the  links  modal  frequencies  and  hence  the  control  sample  rate. 
In  addition,  the  first  (base)  and  the  second  links  have  tip  loads  of  3.5  and  1  Kgs  respec¬ 
tively.  These  loads  further  increase  the  flexibility  of  the  structure  and  the  natural  frequen¬ 
cies  of  the  first  and  second  links  with  clamped  free  boundary  conditions  are  0.6  Hz  and 
1.2  Hz  respectively.  Actuation  consists  of  low-inertia  dc-motors  at  the  two  joints  and  an 
active  bay  (Figure  3)  with  four  electro- strictive  actuators.  Sensing  consists  of  resolvers  at 
the  Joints  and  collocated  strain  sensors  on  the  four  electro-strictive  actuators.  In  addition, 
an  optical  sensor  measures  the  position  of  an  infra-red  LED  mounted  about  the  midpoint 
of  the  the  first  link,  thus  providing  information  of  the  induced  structural  vibrations  during 
the  tracking  operation.  The  entire  structure  is  supported  on  air  bearings  and  controlled 
with  an  Intel  386-based  PC,  servo  amplifiers  for  the  motors  and  150V  servo  amplifiers 
for  the  electro-strictive  actuators. 

A  major  concern  in  the  use  of  electro-strictive  actuators  is  their  non-linear  applied 
voltage  to  effective  strain  behavior.  Temperature  dependent  variations  of  this  relation¬ 
ship,  and  the  presence  of  hysterisis  prevents  their  effective  use  as  precision  actuators. 
Several  techniques  have  been  discussed  in  [15]  to  alleviate  this  problem.  These  include 
biasing  the  electro-strictor,  linearizing  the  model  for  small  inputs,  and  non-linear  models. 
We  propose  a  closed  loop  cascade  controller  (see  Figure  4)  with  a  feedback  control  based 
on  the  effective  strain  induced  by  these  actuators.  A  strain  gauge  collocated  with  the 
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electro-strictive  actuator  is  used  as  sensor  for  feedback.  In  our  experiments  a  simple  pro¬ 
portional  controller  yielded  excellent  results  as  seen  by  plot  of  effective  strain  versus  the 
command  strain  (Figure  5)  where  hysterisis  and  other  non-linear  behavior  of  the  electro- 
strictors  are  eliminated  when  a  feedback  is  used.  Such  a  cascade  control  scheme  is  partic¬ 
ularly  effective  here  because  the  actuator  dynamics  are  much  faster  than  the  dominant 
modes  of  the  multibody. 

To  evaluate  the  proposed  use  of  distributed  actuators  developed  in  Section  2,  we 
apply  it  to  track  the  end-effector  of  the  two  link  flexible  manipulator.  The  desired  trajec¬ 
tory  of  the  end-effector  is  a  series  of  rest  to  rest  motions,  while  the  first  link  is  stationaiy. 
The  nominal  motion  of  the  second  link  is  shown  in  Figure  6.  Our  objective  is  to  track  the 
desired  trajectory  and  minimize  the  vibrations  in  the  first  link  which  is  equipped  with 
electro-strictive  actuators.  To  evaluate  the  vibration  reduction  achieved,  we  conduct  the 
following  tracking  experiments:  (1)  feedforward  of  torques  computed  without  inverse 
dynamics,  i.e.  assuming  the  links  to  be  rigid;  (2)  using  the  torques  computed  by  inverse 
dynamics  for  only  the  joint  actuators;  and  (3)  incorporating  the  distributed  electro- 
strictive  actuators  on  the  first  truss  along  with  joint  actuators  in  the  inverse  dynamics 
computation  and  using  these  as  feedforward.  In  each  case  a  joint  based  PD  controller  was 
used  for  controlling  errors  due  to  unmodeled  dynamics,  friction  and  other  modeling 
errors.  The  stability  of  such  joint  based  controllers  are  discussed  in  [9].  The  results  of  our 
experiments  are  presented  below. 

Plots  of  the  inputs  to  the  electro-strictor  and  joint  motors  are  presented  in  Figures  7, 
8  and  9.  Note  that  the  actuations  start  before  the  tip  trajectory  begins.  This  non-causality 
due  to  the  propagation  delays  is  reduced  when  additional  actuation  is  available  through 
the  piezos  as  seen  in  Figure  9.  To  illustrate  the  viability  of  the  proposed  method  we  plot 
the  transverse  structural  deflections  at  the  midpoint  of  the  first  link  (Figure  10)  during  the 
motion  obtained  by  an  infra-red  led  mounted  on  the  structure  and  an  over- head  optical 
sensor.  These  elastic  deflections  in  the  structure  are  considerably  reduced  when  electro- 
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strictive  actuators  are  also  used  in  addition  to  the  joint  motors.  On  the  contrary  if  inverse 
dynamics  is  not  used  and  the  rigid  body  torques  are  used  then  the  resulting  motion  has 
much  larger  vibrations. 

Thus  the  incorporation  of  electro-strictive  actuators  results  in  a  significant  reduction 
in  the  structural  vibrations  and  demonstrates  the  viability  of  the  proposed  method.  The 
consequent  reduction  (50%)  in  the  induced  vibrations  of  the  structure  allows  the  use  of 
lighter  elements  and  therefore  smaller  joint  actuators,  especially  in  space  structures 
where  the  loads  are  mainly  inertial. 

4.  Conclusion 

Typically  distributed  actuators  like  the  electro-strictive  ones  cannot  gamer  enough  actua¬ 
tion  to  cause  large  motions  in  the  multibody  system.  However  they  could  be  very  effec¬ 
tive  in  reducing  structural  deformation.  To  reduce  such  vibrations  by  the  use  of  distri¬ 
buted  actuators  in  feedforward  aiding  joint  actuators  for  trajectory  tracking  is  a  novel 
idea  developed  in  this  paper.  The  method  proposed  is  extremely  efficient  as  it  optimaly 
reduces  structural  vibrations  and  the  theory  developed  was  verified  by  experiments.  The 
use  of  the  redundant  distributed  actuators  seems  promising  in  the  slewing  control  of  flexi¬ 
ble  manipulators  and  other  space  structures,  and  motivates  further  work  on  distributed 
actuators  for  the  control  of  flexible  multibodies.  In  particular,  for  systems  with  largely 
repetitive  trajectory  tracking  requirements,  future  work  will  address  the  actuator  design 
problems  Uke  placement  and  sizing,  from  a  vibration  minimization  perspective. 
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Abstract 

In  this  paper  we  address  the  problem  of  actuator  place¬ 
ment  for  articulated  flexible  structures.  Specifically,  the  place¬ 
ment  of  a  single  active  bay  on  an  articulated  truss  is  fimnmated 
and  solved  for  the  UCSB/Astro-Aerospace  truss.  We  observe 
that  controllability  grammian  methods  are  not  reliable  in  gen¬ 
eral  and  that  better  placement;  achieved  using  closed-loop 
performance  measures.  Some  preliminary  vibration  damping 
experiments  are  also  described. 

1.  Introduction 

The  control  of  large  flexible  structures  has  been  con¬ 
sidered  for  some  time  [1].  However,  the  recent  qrplication  of 
piezoelectric  materials  by  Crawley  [2]  for  actuation  of  flexible 
structures  has  added  a  new  dimension  to  this  problem  beemse 
these  actuators  can  be  distributed  along  structural  members  for 
vibration  and  shape  control.  This  pi^  deals  with  the  problem 
of  actuator  placement  to  achieve  optunal  danqnng  of  vibrarions 
in  articulat^  flexible  structures  (AFS)  throu^  the  example  of 
an  experimeiual  two-link  two-joint  truss  structure.  The  key 
issue  in  such  optimizations  is  the  use  of  a  suitable  design 
evaluation  measure.  Once  a  numeriesUy  tractable  measure  is 
available,  a  combined  structure-controUe,'  design  optitnization 
can  be  carried  out.  We  approach  the  problem  in  two  steps,  first, 
different  measures  to  perform  design  evaluation  in  linear  struc¬ 
tures  are  studied,  and  second,  we  extend  two  of  these  measures 
to  articulated  fiexible  structurea  (AFS)  with  non-linear  dynam¬ 
ics. 

For  structures  with  linear  dynamics,  our  goal  is  to  find 
the  placement  to  maximize  modri  damping  when  feedbadt 
control  is  applied.  Similar  problems  has  bm  studied  previ¬ 
ously  literature.  In  Crawley's  eariy  work  the  actuator  wu  sim¬ 
ply  placed  with  one  bendmg  mode  in  mind  at  the  location  of 
maximum  strain  for  that  mode  [2].  However  the  placement 
problem  for  the  case  with  two  or  more  controlled  modes  was 
not  addressed.  Kondoh  et  al  [3]  used  the  linear  quadratic- 
optimal  control  framework  to  perform  sensor  snd  actuator 
placement,  but  formulated  the  problem  such  that  the  solution  is 
initial  condition  dependent  --  this  dependence  is  removed  here. 
Controllability  was  used  as  a  performance  measure  far  place¬ 
ment  of  a  pomt  acmator  in  [4].  Our  main  result  for  die  linear 
case  states  that  methods  bas^  on  the  controllability  grammian 
can  yield  less  effective  placement  results  for  vibration  damping 
than  those  based  on  closed-loop  performance  measures. 

Next,  we  consider  die  non-linear  problem  of  placing 
piezo-electric  vibration  dampers  in  articulated  flexible  struc¬ 
tures  (AFS).  The  placement  problem  for  these  jointed  struc¬ 
tures  is  complicate  by  their  nonlinear  dynamics.  Even  widt 
small  linear  vibrations  about  a  given  rigid  t^y  oonfiguration,  a 
good  placement  for  one  configuration  should  not  necessar^ 
work  well  after  the  structure’s  joints  are  rotated.  With  a  finite- 
element  model,  we  have  inde^  observe  a  large  variation  in 
modal  frequencies  (for  the  linearized  dynamics)  as  the  joint 
angles  are  varied.  However,  we  have  discovered  that  for  some 
structures  (and  ours  in  particular)  nodal  rotations  associated 
with  the  mode  shapes  are  relatively  invariant  to  configurations 
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and  uniformly  good  placements  are  possible!  Two  methods  of 
placement  are  explored.  The  first  is  based  on  the  simple  rate 
based  damping  controller  where  placemaite  are  chosen  to  max 
imize  the  decay  rate  of  vibrations.  The  second  method  optim¬ 
izes  placement  based  on  closed-loop  performance  of  Linear- 
(^uaebratk  Regulator.  The  two  methods  are  contrasted  using  i 
finite-element  model  of  our  structure. 

The  remainder  of  the  pqier  is  organized  in  the  following 
format.  In  section  2  we  deroribe  the  experimental  setup  and 
modeling.  Section  3  formulates  measures  to  be  used  for  actua¬ 
tor  placment  in  Unear  structures  and  discusses  the  perfor¬ 
mance  measures.  Section  4  deals  with  the  extension  of  the 
measures  to  articulated  flexible  structures  along  with  an  exam¬ 
ple  and  preaents  simulation  aiad  preliminary  experimental 
results.  Our  conclusions  are  made  in  section  5. 

Experimental  Set-up 

The  effectiveness  of  distributed  actuators  has  been 
demonstrated  for  vibration  reduction  [1].  Such  smicniies  exhi¬ 
bit  residual  vibrations  due  to  imperfecuons  in  the  controller 
and  disnirbances  like  impacts,  thermal  heating  and  miao- 
graviiy.  For  example,  in  space  station  mobile  transporter  mani¬ 
pulators,  gear  trains  exhibit  substantial  friction,  suction  and 
backlash.  Under  such  conditions,  joint  actuators  are  ineffective 
in  controlling  low  amplitude  structural  vibrations.  The  proper 
placement  of  distributed  actuators  is  critical  to  the  control  of 
such  vibrations. 

The  example  we  consider  is  an  experimental  two-link 
truss  structure  (figure  1  and  shown  in  the  video)  developed  at 
UCSB.  The  structure  has  16  sparu  and  two  artic^ations  form¬ 
ing  a  phmar  manipulator.  The  trusses  are  made  of  riui^um 
and  have  lumped  masses  (net  2  Kgs  for  each  link)  distributed 
along  their  lengths  in  order  to  lower  the  modal  frequencies  and 
hence  the  control  sample  rate.  In  addition,  the  first  and  the 
second  links  have  tip  load*  of  3.5  and  1  Kgs  respectively,  and 
dieir  lengths  are  1.8  and  1.1  m  respectively.  Actuation  con¬ 
sists  of  low-inertia  dc-motors  at  the  two  joints  and  one  active 
bay  (Figure  2)  with  four  piezo-electric  actuators.  Sensing  con¬ 
sists  of  resolvers  at  the  joints  and  co-located  strain  senson  on 
the  four  piezo-electric  actuators.  The  entire  structure  is  sup- 
poned  on  air  bearings  and  controlled  with  an  Intel  386-based 
PC,  servo  amplifiers  for  the  moton  and  ISOV  servo  amplifiers 
for  the  Piezos. 

Our  interest  is  to  study  piezo  actuator  placenient  for 
optimally  controlling  structurid  vibrations.  A  related  issue  is 
the  implementation  of  controllers  for  the  distributed  actuaton. 
The  problems  of  implementation  include  de-poling,  nonlinear¬ 
ity.  hysteresis,  and  creep  effects  in  the  piezo  actuator  [5].  De- 
poling  may  be  avoided  ^  maintaining  the  applied  field  below 
the  coercive  field.  Within  the  de-poling  limits,  the  nonlinearity 
between  applied  electric  field  and  the  resulting  actuation  strain 
may  require  the  use  of  more  complex  models  [6].  An  alterna¬ 
tive  is  the  linearization  of  this  relationship  about  the  operating 
point  [7].  Creep  and  strain  rate  dependence  of  the  actuator 
become  important  at  lar^e  strains  and  low  frequencies.  Hys¬ 
teresis  also  plays  an  important  role  at  low  frequencies. 
Significant  pmormance  improvement  over  such  behavior  is 
possible  by  conunanding  the  induced  charge  rather  than  the 
voltage  mlied  to  the  actuator  [8].  Other  issues  related  to 
actuator  dynamics  are  coruidered  in  [9]  and  [10].  We  note  that 
die  Piezos  have  fast  dynamics  relative  to  electromagnetic 
actuators  and  are  attractive  in  this  regard. 
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System  Model 

The  modeling  of  actuator/stnicture  interaction  using 
Snite  dimensional  models  has  been  studied  in  detail  in  [S].  Fol¬ 
lowing  a  similar  approach,  low  amplitude  motions  relative  to  a 
given  rigid  body  ecpiilibrium  configuration  Xg  of  an  articulated 
flexible  structure  is  approximated  using  the  Finite  Element 
Method  (FEM)  [1 1|.  and  can  be  written  as 

Afp.ax  +  D^.ax  +  (2.1a) 

where  x  G  /?"  denotes  the  flexible  degree  of  freedom. 
Mp_9  ,  Dp,9  and,  Kp^s  ,  are  the  mass,  damping  and  sdShess 
matrices  respectively,  p  e  denotes  the  placement,  Qe 
is  the  joim  angle  vector  of  the  structure,  and  Up  G  ^  is  the 
input  voluge  to  the  actuator.  Note  that  joint  motor  torques  do 
not  appear  in  the  equations  as  the  joints  are  assumed  to  be 
locked  due  to  static  friction.  The  effect  of  the  piezo  actuators  is 
modeled  as  two  point  moments  M  (figure  3)  actintLat  the  ends 
of  the  piezo  ([S]  and[12])  and  is  included  in  the  Bp, a  matrix. 
The  output  y  s  R^  is  obtained  from  a  strain  gauge  <^located 
with  the  actuator  and  is  linear  in  the  state  [13] 

y=Cp.at.  (2.1b) 

From  equation  (2.1)  it  is  seen  that  for  each  placement  p>  the 
system  can  be  considered  as  a  family  of  linear  systems 
parametrized  by  9.  Noting  that  at  each  equilibrium  point  is 
defined  by  X  =  X  =  ihe  state  equation  (2.1)  can  be  rewrit¬ 
ten  as 

z(f)  =  y4p,92(t)  +  Bp,eu(f)  (2.2a) 

yU)  =  Cp,i2(l).  (2.2b) 


where 

A  —  i4p,9  +  — Bp,e^  0 lx*  Cp.ej  (3.4) 

The  advantage  of  such  a  co-located  passive  control  is 
that  controllers  deigned  for  a  finite-dimeiuional  model 
remains  stable  with  the  infinite  dimensional  plant  provided  chat 
actuator  dynamics  can  be  neglected  [14].  Since  the  bandwidth 
of  a  piezo  is  only  limited  by  its  capacitance,  actuator  dyrtamics 
can  be  justifiably  ignored  for  large  space  structures.  The  imple¬ 
mentation  issues  of  such  controllers  have  been  addressed  in  [S] 
and  [10]. 

We  measure  the  system  performance  for  a  particular 
choice  of  controller,  placement  and  piezo  length  by  the  rate  of 
decay  of  system  states  and  therefore  seek  to  place  the  poles  of 
the  system  far  into  the  lefr  half  of  the  complex  plane.  More  for¬ 
mally,  we  perform  the  following  optimization 

min  max  RcXi(A)  (3.5) 

P  * 

where  X,-  (A  )  is  the  i*  eigenvalue  of  A ,  and  p  is  one  of  the 
span  mit^ina. 

Linear  Quadratic  Regulator  Case 

The  Linear  (Quadratic  Regulator  (LQR)  is  attractive 
because  the  controller  stabilizes  the  clos^  loop  system  and 
also  allows  for  user  defined  weigha  on  the  inpua  and  sates. 
LQR  optimization  has  been  used  to  reduce  the  structural  vibra¬ 
tions  in  the  control  of  large  flexible  structures  in  [IS]  and  [16] 
but  the  formulations  were  mitial  condition  dependmt  -  this 
dependency  is  removed  here.  For  the  system  described  by 
equation  (2.2),  coiaider  the  infinite-horizon  time-optimal  con¬ 
trol  problem  of  minimizing  a  (quadratic  cost  functional  given  by 


’--It 

/y  ^  j[[Bv(t)2-t-zr(t)Gz(l)]di 

(3.6) 

x^  i^l  eR^  , 

(2.2c) 

0 

Oaxj»  fitXH  1  .  - 

Ap,o=  ,  .  .(2A1) 

-MpSDp,% 


where  B  is  a  positive  scalar  and  Q  is  positive  semi-definite 
matrix  such  that  the  pair  (A^,$,C  is  observable.  Provided 
system  (2.2)  satisfies  the  standard  conditions  of  stabilizability 
md  detectability,  the  minimum  cost  mjnyy  is  given  by 

z^(to)Pz  (to)  where  F  is  the  unique  noimegative-definite  solu¬ 
tion  to  the  algebraic  Ricatii  equation]  17] 


and 

Cp.B=[Cp,9  Oi^'\  gB‘*2-  .  (2.2f) 

3.  Actuator  Placement  for  Linear  Systems 

In  this  section  we  formulate  three  optimization  problems 
for  determining  a  good  placement  of  the  piezo  actuator  for  the 
linearized  system.  Our  objective  being  an  actuator  placement 
to  achieve  optimal  vibration  damping  in  the  elastic  modes.  In 
words  they  are  the  following.  (1)  Subject  to  the  constraint  that 
co-located  damping  control  is  used,  find  the  placement  that 
maximizes  damping  uniformly  in  the  modes.  (2)  Assuming  the 
system  is  detectable  and  stabilizable,  find  the  placement  that 
minimizes  a  standard  linear  quadratic  cost  fimctional  uniformly 
in  initial  conditions.  (3)  Find  the  placement  that  maximizes  the 
minimum  eigenvalue  of  the  controllability  grammian  (as  is 
done  for  the  actuator  placement  problem  by  A^l  [4]). 

Passive  Damping  Case 

For  the  first  optimization  problem,  we  assume  a  jxire 
co-located  damping  control  given  by 

v{l)  =  -kiCp,Bi  (3.1) 

where  kj  is  the  scalar  controller  gain.  With  this  control  the 
state-space  description  (2.2)  becomes 


PAp,B  +  Ap,B^P  -PBp,BR-'BpyP  +Q=Q.  (3.7) 

The  corresponding  control  is 

v(/)  =  -B-’Bp,9^Fz(/).  (3.8) 

We  propose  minimizing  /y  for  the  worst  case  initial  condition 
and  therefore  pose  the  following  optimizaticn  for  computing 
the  placement 

min  maxzJFzo.  *o  =  *(io).  (3.9) 

This  method  optimizes  performance  uniformly  in  initial  condi¬ 
tions  in  contrast  to  the  approach  by  Kondoh  er  al  [3]  where  a 
solution  sensitive  to  iititial  conditions  is  proposed. 

Controllability  Grammian  Method 

Finally,  we  describe  a  placement  procedure  based  on  the 
controUabiliy  grammian  proposed  by  [4],  This  method  is  use¬ 
ful.  but  has  certain  disadvantages  disused  in  the  following 
sectioit  Assuming  that  the  matrix  Ap,e  in  the  state  space 
description  (2.2)  is  stable,  as  the  final  time  T  tends  to  infinity, 
it  can  be  sluwn  that  the  finite  time  controllability  gramiruan 
li^(0,r)  approaches  W,  the  solution  of  the  Lyapunov  equation 

WAp,9^  +  Ap^aW  +  Bp^tBpy  =  0.  (3.10) 

Based  on  Arbel’s  method  we  perform  the  optimization 

maxminX,(W)  (3.11) 

p  * 

to  maximize  the  controllability  of  all  the  modes.  This  approach 
is  discussed  in  the  following  sub-section. 


z(t)  =  A  z(0 


(3.2) 
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Discussion  of  the  Controllability  Grammian  based  Measure 

We  discuss  in  this  sub-section  the  drawbacks  of  the  oon- 
troUability  pammian  method  qrplied  by  Aibel  [4],  Hie  cost 
for  the  minimum  ene^  control  given  initial  cc^tion  z({o) 
and  final  condition  z  (jT)  is 

J  k  [z(T)-e'^*^"'‘’z«o)]  H'-'(0,r)[z(r)-e'^*‘^''*>z(/o)] 

(3.12) 

where  ~  is  the  state  transition  matrix.  /  is  die  energy 
required  to  deviate  firom  the  natural  m^oii  of  the  system, 
which  would  have  reached  the  state  “ **'z  (to)  at  time  T 
without  the  q^licatkm  of  any  control.  Arbel’s  method  is 
based  on  optimizing  J  over  all  possible  initial  end  final  condi¬ 
tions  and  results  in  minimizing  the  maximum  eieenvalue  of  the 
inverse  of  the  controllability  grammian,  over  dif¬ 

ferent  actuator  positions.  In  contrast  we  are  interested  in  a  par¬ 
ticular  final  coiidition  namely  zero.  The  control  cost  J,  for  tiiis 
finite  time  regulator  problem  (z  (T)  =  0)  is  gjven  by 

Jr  k  [c^'^^-'^zCfo)]  V-'(0J)  (to)]  (3.13) 

which  differs  from  J  simificantly.  Next  we  show  that  the  max¬ 
imum  eigenvalue  of  Vr '‘(0,7)  increases  as  the  system  poles 
move  far  into  the  left  half  of  the  complex  plane.  Thus  J 
increases  as  the  system  becomes  more  dampedi  Let  y  be  an 
eigenvalue  of  W  and  let  v  be  the  corresponding  eigenvector 
such  that  I  vl  2  =  1.  Pre-  and  post-multiplying  the  Lyapunov 
equation  (3.10)  by  and  v  reqtectively,  we  obtain 

v^»V'i4;,.e^v  -t-  v^Ap,«Wv  +  =0(3.14) 


Then 


2vMp.oV 


^  niin'nLj(;c;.oyi — 


The  above  result  implies  that  for  a  given  fi^,o  the  eigenvalues 
of  W~^  increase  if  the  eigenvalues  of  Ap,^  move  into  the  left 
half  plane.  Hence  Arbel's  approach  deems  the  system  u  less 
controllable  and  therefore  not  desirable  even  thoujgh  the  above 
shifting  of  the  poles  is  edvantageous  for  vibration  reductioTL 

This  decrease  in  controllability  car.  be  easily  illustrated 
through  a  scalar  example  described  by 

x(t)  =  -ax(t)  +  bu(t).  (3.16) 


Using  Arbel's  method  for  a  large  final  time  T,  the  eigenvalue 
of  W  (0,7)  tends  to  which  decreases  u  a  increases,  and 

subsequently  the  controllability  measure  deems  the  system  as 
less  controllable  even  though  from  a  vibration  reduction  per¬ 
spective  it  is  more  desirable.  A  similar  effect  is  seen  in  flexible 
structures  with  large  decay  rates  in  the  higher  modes  [13]. 
Arbel’s  placement  method  would  therefore  push  the  placment 
towards  locations  where  the  higher  modes  have  maximum 
strain.  However,  due  to  lower  decay  rates,  control  of  the  lower 
modes  is  more  critical  from  a  vibration  suppression  perspec¬ 
tive.  Hius  for  vibration  reduction,  care  must  be  exercised 
when  using  the  controllability  grammian  based  measures. 

4.  Actuator  Plsccmeot  For  AFS 

In  this  section  we  formulate  the  problem  for  the  place¬ 
ment  of  distributed  actuators  in  a  general  articultted  flexible 
structure  (AFS).  The  set  of  equilibria  in  AFS  consists  of  rigid 
body  configurations  with  zero  elastic  deformations.  Our  control 
objective  is  to  obtain  uniformly  good  regulation  of  elastic 
def^ormations  at  any  equilibrium  state  (rigid  body 
configuration)  of  the  articulate  structure.  This  impbes  that  the 
set  of  desired  trajectories  Xe.  are  constant  trajectories.  Our 
approach  is  based  on  linearization  (Section  H  of  the  system 
equations  at  each  equilibrium  point  in  the  set  Xg  followed  1^  a 
local  cost  measure  (Section  3)  evaluation  at  these  equiUbrium 
points.  In  the  following  subsection  we  describe  this  pro¬ 
cedure. 


Problem  Statement 

A  local  cost  functional  c(x^,p)  is  defined  for  every 
placemem  p  and  rigid  body  configuration  Xo  of  the  articulated 
flexible  structure  u  follows.  For  the  passive  oamping  con¬ 
troller  (3.5)  the  local  cost  function  is  defined  as 

c(Xi,,p)  k  max  RcXi(A)  (4.la) 

and 

c(Jc*.P)  ^  zo  =  z(to).  (4.1b) 

for  the  LQR  controller  (3.9).  Based  on  cost  measures  evaluated 
through  local  linearization  (section  3)  we  can  state  the  place¬ 
ment  problem  as 

C(p)  (4.2) 

where  the  functional  Cip)  assigns  a  global  cost  for  each 

placement  by  evaluating  a  suitifole  norm  on  the  function 
c(-,p):Xt-*R^.  If  for  example,  die  set  of  e^librium 
poi^  Xa .  is  contained  in  /{ * ,  dim  two  possible  choices  of  the 
the  function  C  (  p  )  are  the  average  and  worst  case: 

\cixa,p)dx 

‘  and  (4.3) 

^ina^  c(Xa,p).  (4.4) 


These  measures  are  used  to  numerically  solve  die  problem  of 
pladng  distributed  Piezo-electric  actuators  in  a  two-link  flexi¬ 
ble  manipulator-strictive  in  the  next  section. 

Example 

Fbr  the  two-link  flexible  manipulator  (figure  1),  we 
study  die  distributed  actuator  placement  to  optimally  control 
small  structural  vibrations  when  the  joint  motors  are  effectively 
locked  because  of  static  friction.  This  corresponds  to  clamped 
boundary  conditions  at  the  joints.  Our  objective  being  an 
actuator  placement  in  die  truss  smicnire  to  achieve  optimal 
vibration  damping  in  the  elastic  modes  over  a  set  of  rigid  body 
equUibrium  configurations.  We  study  die  actuator  placement  at 
a  single  shoulder  joint  configuraticn  because  the  system 
dynamics  are  independent  of  the  shouldei  j^t  angle.  Also, 
the  elbow  joiiU  8  is  constrained  by  1 6 1  ^  90^  in  die  experi¬ 
ment.  Then  by  symmetry  it  is  sufficient  to  study  the  problem 
for  elbow  joint  angles  b^een  O’  and  90" .  Hence  die  set  of 
equilibrium  states  Xa,  containing  rigid  body  configurations 
with  zero  elastic  deflections,  is  parametrized  by  the  eltew  joint 
angle  6  and  is  independent  of  the  placement  p . 

The  two  measures  defined  in  the  above  sub-section  are  used  to 
generate  solutions  to  the  placement  problem  of  the  piezo¬ 
actuators.  The  passive  co-located  damping  controller  con¬ 
sidered  is  parametrized  by  a  scalar  feedback  gain  Ke  and  the 
feedback  law  can  be  written  as  [13] 

H=-Xrf[o,„  Cp.e]  Z  .  (4.5) 


While  computing  the  cost,  an  mtimization  is  also  performed 
over  th:.  ’-calar  controller  gain  This  implies  that  the  local 
measui .  l'  (/>,  Xp  ),  with  the  passive  damping  controllcT  is  given 
by 

c{p,Xa)  k  min  m8xReX,(A).  (4.6) 


For  the  LQR  controller  based  measure,  weighting  matrix  Q  is 
chosen  to  be  uniform  in  the  elastic  displacements  as 


(2  = 
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(4.7) 


The  choice  of  the  weight on  the  control  input  depends 
on  the  de-poling  limit  of  the  actuator  and  the  initial  conditions. 
For  a  given  R ,  and  ratimal  placement  pap  and  the  maximum 
input  voluge  Umax(R)  configuratiotu  is  found. 
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This  variation  of  Um„{R )  as  a  function  of  R  and  a  (the  bound 
on  the  initial  conditions  i.e.  Ij^I  is  shown  in  figure  4. 
Based  on  de-poling  limia  for  the  actuator,  the  maximum 
applied  voltage  is  limited  to  3(X)V  which  corresponds  to  an 
applied  field  of  2  MV/m.  For  OL  =  le—3,  corresponding  to  a 
maximum  transverse  elastic  deflections  of  approximately  3cm . 
^  =  le— 6  falls  in  the  suitable  region,  with  maximum  input 
voltages  below  the  actuator  de-poling  limit 

For  the  averaging  global  measure  defined  by  equation  (43),  the 
passive  damping  based  measure  yielded  a  placement  at  the  root 
of  the  first  luik,  while  the  secortd  case  of  LQR  based  measures 
resulted  in  a  placement  at  the  root  of  the  second  link  (figures 
8,9).  It  is  seen  in  these  figures  that  the  placements  obtain^  are 
relatively  "good"  over  all  joint  configurations. 

Discussion 

As  the  joint  angle  9  varies  from  0  to  90* ,  there  is  a  sub¬ 
stantial  variation  in  the  system  eigenvalues.  This  is  represented 
in  figure  5,  where  the  variation  of  the  lowest  modal  frequency 
with  elbow  joint  angle  is  shown.  This  significant  (40% 
increase)  change  in  the  modal  frequencies  might  deem  uni¬ 
formly  good  placements  impossible.  The  uniformity  of  place¬ 
ment  location  is  a  desirable  result  because  it  implies  that  a  sin¬ 
gle  placement  is  effective  in  reducing  vibrations  at  the  different 
configurations.  The  placement  based  on  passive  dampitu  con¬ 
troller  is  seen  to  be  invariant  with  joint  configuration.  the 
LQR  case  the  resultant  global  optimal  placement  is  not  the  best 
for  certain  configurations,  however  the  increase  in  cost  is  negli¬ 
gible  ( 1  %)  as  seen  in  figure  9a.  This  near  optimality  over  all 
configurations  is  attributed  to  the  relative  invariance  of  the 
eigenvector  components  associated  with  the  nodal  rotations  at 
different  joint  angles. 

The  placement  invariance  is  due  to  the  fact  that  in  our 
example  the  ei^vectors  are  relatively  invariant  with  the  joint 
configuration.  This  is  illustrated  in  figtne  6,  where  the  projec¬ 
tion  of  the  eigenvecton,  associated  with  the  least  two  eigen¬ 
values.  onto  the  rotary  elastic  displacements  at  the  FEM  nodes 
of  the  manipulator  model  is  showtL  These  correspond  to  the 
spatial  derivative  of  the  projection  of  the  eigenvectors  oiuo  the 
transverse  elastic  displacemenu  of  the  manipulatOT.  Note  that 
these  rotary  mode  shapes  shown  in  figure  6,  vary  little  with 
joint  angle.  As  the  placement  varies  over  the  length  of  the 
manipulator,  the  relaave  control  over  each  mode  and  its  varia¬ 
tion  are  determined  by  these  mode  shapes  [10].  Cons^entlv. 
the  invariance  of  these  mode  shapes  with  the  joiiu  an^e 
implies  the  relative  invariance  of  the  distribution  of  effective 
control  over  different  modes.  It  is  this  control  distribution  of 
the  different  modes  that  determines  the  cost  for  the  possible 
placement  and  therefore  its  invariance  resulta  in  a  uniformly 
good  placement. 

The  performance  measure  based  on  passive  damping  is 
determined  by  the  distance  of  system  eigenvalues  from  the 
imaginary  axis.  This  distance  is  an  estimate  of  the  sloweat 
decay  rate  of  elastic  perturbations  in  the  structure.  In  our  exam¬ 
ple,  the  decay  rate  tends  to  increase  with  the  elbow  joint-angle 
as  shown  in  figure  7.  This  is  maiidy  due  to  the  iicrease  in  the 
system  natural  frequencies,  but  with  constant  structural  dunp- 
ing.  For  such  a  system,  the  elastic  vibration  modes  associate 
with  higher  modal  frequencies  tend  to  have  higher  decay  rates. 
Thus  m  the  case  of  the  passive  coiuioller  based  performance 
measure,  it  is  the  variation  of  the  mode  associated  with  the 
lowest  frequency  that  determines  the  cost  and  therefore  the 
placement  To  control  a  single  eigenvector,  the  optimal  place¬ 
ment  is  at  the  location  of  maximum  strain  (18)  where  the  effec- 
uve  control  over  that  mode  is  maximized.  Hence  the  passive 
damping  based  measure  yields  the  placement  at  the  first  man  of 
the  structure.  As  discussed  above,  this  placement  is  uniformly 
good  for  all  joint  configurations  because  nodal  rotations  associ¬ 
ated  with  the  mode  shapes  are  relatively  invariam. 

The  LQR  based  performance  measure  resulted  in  a 
placement  where  the  qua^atic  cost  functional  given  by  equa¬ 
tion  (2.7)  is  minimized.  The  optiimzation  of  this  quadrabc  cost 
functional  results  in  a  compromise  placement  between  high 
decay  rates  of  perturbations  and  small  control  effort  Note  thiu 
the  placement  obtained  by  using  this  performance  measure  is 
relatively  good  over  all  configurations.  Thus  the  invariance  of 
the  mode  shapes  aided  in  placements  which  were  near-optimal 


The  reader  is  cautioned  that  this  invariance  in  the 
mode-shapes  need  not  be  true  for  all  AFS.  The  vanaiions  in  the 
mode  shapes  aivl  the  modal  frequencies  with  configuration  can 
in  general  affect  the  placement  Hence  it  is  not  always  possible 
to  find  placements  that  are  uniformly  good  over  different 
configuratior,s.  In  such  circumstances  the  placement  should  be 
based  on  either  the  average  jjerformance  or  worst  case  perfor¬ 
mance  described  in  section  2,  and  then  the  resulting  placement 
yields  the  best  achievable  performance. 

Preliminary  experimental  results  for  co-located  passive 
damping  based  controllers  are  given  in  figure  10.  where  we 
show  the  frequency  response  between  the  elastic  vibrations  at 
the  end  of  first  link  span  8  obtained  through  an  optical  sensor 
and  input  to  the  base  joint  actuator.  We  obtain  a  7db  improve¬ 
ment  in  vibration  suppression  at  the  first  pin- free  resonance. 

6.  Conclusioa 

In  this  paper  we  formulated  the  actuator  placement  for 
both  linear  and  articulated  flexible  structures.  It  was  shown 
that  the  controllability  grammian  based  placement  technique  is 
not  suitable  for  vibration  damping  firoblems.  For  general  arti¬ 
culated  structures,  two  closed  loop  performance  cntenons  were 
used  to  derive  objective  functions  for  optimum  placement  of 
actuators.  The  procedures,  developed  independent  of  initial 
conditions  and  formulated  as  eigenvalue  problems,  are  easy  to 
compute.  In  the  example  two-link  manipulator,  these  measures 
are  computationally  tractable  and  effective  to  solve  the  distri¬ 
buted  actuator  placement  problem  far  optimal  structural  vibra¬ 
tion  reduction.  An  important  observation  is  that  the  mode 
shapes  associated  with  nodal  rotations  for  the  example  two-link 
flexible  mani^ator  are  relatively  invariant  with  joint 
configuration.  This  invariance  resulied  in  a  placement  which  is 
effective  in  reducing  vibrations  over  all  the  different 
configurations. 

The  measures  suggested  in  this  paper  can  also  be  used  for 
actuator  placement  in  set  point  controllers  of  non-linear  sys¬ 
tems.  In  this  sense  the  approach  is  a  modest  aaempt  at  the 
larger  problem  of  non-linear  actuator  placement  The  existence 
and  compuubility  of  optimal  designs  for  general  trajectory 
tracking  in  rton-linear  systems  is  the  subject  of  further  work. 
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Figure  1.  Experimental  smart  flexible  strucmre 


Figure  2.  The  active  bay 


t  -8  :  FEM  Degrees  of  freedom 
M  :  Ettective  Piezo  Actuation 
9 :  Lumped  mass 


Figure  3.  Equivalent  FEM  representation 
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Figure  4.  Variation  of  first  modal  frequency  of  the  structure 
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Figure  5.  Variation  of  maximum  applied  voltage 
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SUMMARY 

technique  is  presented  for  solving  the  inverse  dynamics  of  flexible  planar  multibody  systems.  This 
technique  yields  the  non-causal  joint  efforts  (inverse  dynamics)  as  well  as  the  internal  states  (inverse 
kinematics)  that  produce  a  prescribed  nominal  trajectory  of  the  end  effector.  A  non-recursive  Lagrangian 
approach  is  used  in  formulating  the  equations  of  motion  as  well  as  in  solving  the  inverse  dynamics 
equations.  Contrary  to  the  recursive  method  previously  presented,  the  proposed  method  solves  the  inverse 
problem  in  a  systematic  and  direct  manner  for  both  open-chain  as  well  as  closed-chain  configurations. 
Numerical  simulation  shows  that  the  proposed  procedure  provides  an  excellent  tracking  of  the  desired  end 
effector  trajectory. 


1.  INTRODUCTION 

Accurate  positioning  and  vibration  minimization  of  flexible  multibody  systems  have  generated 
considerable  interest  from  the  computational  dynamics  and  controls  communities.  The  advent  of 
the  new  generation  of  very  fast,  lightweight  robots  and  flexible  articulated  space  structures  has 
made  the  control  of  structural  vibrations- an  important  practical  problem  in  the  manufacturing 
and  space  industries,  respectively. 

There  is  a  large  body  of  literature  dealing  with  the  forward  dynamic  analysis  of  flexible 
multibody  systems,  i.e.  the  determination  of  the  resulting  motion  when  the  joint  forces  and 
external  forces  are  given.  Several  authors*  have  proposed  the  use  of  floating  reference  frames, 
while  others**  have  put  forward  the  use  of  inertial  reference  frames.  Winfrey*  proposed  the 
superposition  of  linear  deflection  of  flexible  bodies  to  the  non-linear  rigid  body  motion.  Bahgat 
and  Willmert^  presented  a  finite  element  approach  for  vibration  analysis  of  planar  mechanisms 
using  the  assumption  that  rigid  body  motion  is  determined  by  rigid  body  kinematic  analysis  and 
the  elastic  response  is  driven  by  the  inertial  forces  generated  by  the  rigid  body  motion.  Likins^ 
considered  the  coupling  effects  of  rigid  body  motion  and  elastic  deformation  in  the  analysis  of 
tree-structured  flexible  systems.  De  Veubeke*  proposed  the  use  of  quasi-co-ordinates  and  a  mean 
axis  co-ordinate  system  to  simplify  the  equations  of  motioa  Song  and  Haug’  proposed  the  use  of 
centre-of-mass  co-ordinates  and  elastic  deflections  as  generalized  co-ordinates  for  the  analysis  of 
planar  mechanisms  compose^  of  beam  elements.  Shabana  and  Wehage*  used  the  generalized 
co-ordinate  partitioning  method  in  the  analysis  of  inertia-variant  flexible  mechanical  systems. 
Sunada  and  Dubowsky^  used  four-by-four  matrix  methods  and  component  mode  synthesis  to 
analyse  three-dimensional  flexible  robots.  Agrawal  and  .’vhabana^  investigated  the  dynamic 
characteristics  of  general  inertia-variant  flexible  multibody  systems  using  Euler  parameters  for 
rigid  body  co-ordinates  and  different  finite  element  mass  formulations.  Kim  and  Haug’  extended 
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the  recursive  formulations  for  the  dynamics  of  rigid  multibodies  to  open-chain  and  closed-chain 
flexible  multibody  systems.  Vukasovic  et  al.'°  presented  the  use  of  fully  Cartesian  co-ordinates  in 
the  analysis  of  flexible  multibody  systems.  The  etficiency  of  this  approach  lies  in  the  fact  that  only 
the  coupling  terms  between  rigid  and  elastic  co-ordinates  become  time-variant. 

Different  approaches  which  use  the  inertial  frame  of  reference  to  describe  the  large  overall 
motion  of  flexible  beams  have  been  proposed  by  Simo  and  Vu-Quoc,“  Cardona  and  Geradin*’ 
and  Avello  et  among  others.  The  use  of  an  inertial  frame  of  reference  leads  to  linear, 
uncoupled  inertia  terms  in  the  exp'-ession  for  kinetic  energy,  while  the  expression  for  the  potential 
energy  functional  becomes  non-linear.  Essential  to  this  type  of  formulation  is  the  use  of  finite 
strain  and  finite  rotation  theories  that  are  capable  of  treating  large  deformations  and  large 
rotations.  This  approach  has  the  advantage  that  it  captures  the  non-linear  stiffening  effects  that 
become  important  at  large  speeds  of  operation.  However,  it  has  the  disadvantage  of  having 
a  more  involved  implementation  than  the  formulation  based  on  the  use  of  floating  reference 
frames. 

On  the  other  hand,  numerous  control  approaches  have  also  been  proposed  for  the  position 
control  of  flexible  multibody  systems.  An  early  work  was  that  of  Cannon  and  Schmitz.’*  who 
presented  an  optimal  linear  quadratic  technique  to  control  the  tip  trajectory  of  a  single-link 
flexible  robot  arm.  Singh  and  Schy‘*  proposed  a  joint  space  close-  loop  control  for  elastic  robots 
by  applying  a  causal  non-linear  inversion  and  modal  damping.  Siciliano  and  Book’*  presented 
a  singular  perturbation  approach  to  identify  reduced-order  systems  used  to  obtain  a  collocated 
control  scheme.  Pfeiffer’^  suggested  a  multistage  control  strategy  consisting  of  a  feedforward 
based  on  rigid  body  inverse  dynamics,  and  a  stabilizing  feedback  on  the  linearized  system  around 
the  rigid  trajectory.  De  Luca  et  ai.^^  proposed  a  closed-loop  control  scheme  consisting  of 
a  model-based  feedforward  term  and  a  linear  feedback  on  joint  angles.  Oakley  and  Cannon” 
implemented  a  multilink  arm  controller  based  on  the  LQG  design  of  the  linearized  arm. 

Looking  at  the  vibration  minimization  problem  from  another  perspective.  Bayo^°  presented 
the  solution  of  the  inverse  dynamics  of  a  single-link  flexible  arm  in  the  frequency  domain.  The 
inverse  dynamics  yields  a  non-causal  or  time-delayed  joint  torque  (applieo  in  negative  tme  and 
future  time)  that  is  capable  of  positioning  the  end  effector  according  to  a  desired  trajectory.  Bayo 
and  Moulin^’  extended  the  inverse  dynamics  to  the  time  domain  by  making  use  of  a  bilateral 
convolution  integral.  Essential  to  the  inverse  dynamics  of  elastic  multibodies  is  the  realization 
that  the  joint  efforts  start  actuating  before  'he  end  effector  or  control  point  does.  Consequently, 
for  a  small  amount  of  time  the  joint  forces  do  not  cause  any  tip  motion.  This  effect  is  called 
non-causality  and  the  inverse  dynamics  has  to  account  for  it.  This  also  constitutes  a  difference  of 
significant  importance  between  the  rigid  and  flexible  multibodies  because  in  the  rigid  case  the 
inverse  dynamics  is  causal  (instant  response),  wher”  *  .t  is  non-causal  (time-anticipatory  re¬ 
sponse)  in  the  flexible  case.  As  shown  by  Bayo  and  Moulin.^’  the  causal  integration  of  the  inverse 
dynamics  of  the  flexible  case  leads  to  unstable  results.  Kwon  and  Book^^  also  proposed  a  solution 
for  the  inverse  problem  for  a  single-link  fle,  ibie  arm  by  dividing  the  inverse  system  into  a  causal 
part  and  an  anticausal  part  to  calculate  the  joint  torque  and  state  variables  in  the  time  domain  for 
a  given  end  effector  trajectory.  Bayo  et  at.^^  solved  the  multilink  case  uring  a  recursive  procedure 
suitable  for  open-chain  configurations.  The  importance  of  using  the  inverse  dynamics  approach 
to  vibration  control  has  been  demonstrated  recently  by  Paden  et  al.^*  who  have  used  passive 
feedback  and  feedforward  of  the  inverse  dynamics  torque  to  achieve  an  exponentially  stable 
tracking  control  law  that  yields  excellent  end-point  tracking  of  flexible  multibody  systems. 

Recursive  methods  are  limited  to  open-chain  configurations,  since  for  closed-chain  systems 
they  require  ad  hoc  procedures^”  that  strongly  depend  on  the  given  configurations.  This  limita¬ 
tion  motivated  the  research  described  in  this  paper  in  which  we  present  a  non-recursive 


FLEXIBLE  MULTIBODY  SYSTEMS:  THE  PLANAR  CASE 


2727 


Lagrangian  approach  for  the  solution  of  the  non-causal  (time-anticipatory)  inverse  dynamics  of 
flexible  multibody  systems.  This  method  provides  a  systematic  way  of  generating  and  solving  the 
inverse  problem  for  open-chain  as  well  as  closed-chain  systems  in  either  the  frequency  or  the  time 
domain.  In  the  next  sections,  the  equations  of  motion  are  formulated  and  a  solution  procedure  for 
the  inverse  dynamics  of  general  planar  flexible  multibody  systems  is  proposed.  Simulation  results 
for  open-chain  and  closed-chain  configurations  are  presented  to  illustrate  the  accuracy  of  the 
method. 


2.  MATHEMATICAL  FORMULATION 


In  this  paper,  the  floating  frame  of  reference  is  used  to  represent  the  kinematics  of  the  deformable 
bodies  comprising  the  planar  multibody  system.  Hence,  the  configuration  of  a  typical  component 
of  a  planar  multibody  shown  in  Figure  1  can  be  desenbed  by  two  sets  of  co-ordinates:  the  first  set 
corresponds  to  the  rigid  body  co-ordinates  representing  the  location  and  orientation  of  the  body 
axes  with  respect  to  the  inertial  frame;  the  second  set  corresponds  to  the  so-called  deformation 
co-ordinates  or  nodal  deformations  representing  the  deformation  of  the  body  with  respect  to  the 
body  axes.  Using  the  aforementioned  choice  of  co-ordinates,  the  location  of  an  arbitrary  point 
P  in  a  planar  deformable  body  i  is  given  by^’ 

r'  =  R‘-(-A'u‘  (1) 


where  R‘  is  the  location  of  the  origin  of  the  body  axes  with  respect  to  the  inertial  frame.  u‘  is  the 
location  of  point  P  with  respect  to  the  body  axes,  and  A'  is  the  rotation  transformation  matrix 
from  the  body  axes  to  the  inertial  frame.  In  the  planar  case,  the  transformation  matrix  is  given  by 


cos9‘ 

sind‘ 


-  sin  6' 
cos  d‘ 


(2) 


where  d'  is  the  angle  of  rotation  of  the  body  axes  with  respect  to  the  inertial  frame.  The  vector  u‘ 
can  be  decomposed  into 

u‘  =:  <  -I-  Uf  (3) 


Figure  I.  Reference  co-ordinates  for  a  planar  body 
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where  uf  is  the  position  vector  of  point  P  in  the  undeformed  state  with  respect  to  the  body  axes, 
and  Uf  is  the  deformation  vector  of  point  P  with  respect  to  the  body  axes.  The  deformation  vector 
Uf  can  be  expressed  in  terms  of  the  nodal  deformations  by  using  a  finite  element  discretization 
scheme 

Uf  =  N‘qr  (4) 

where  N'  is  the  shape  function  matrix  and  qj  is  the  nodal  deformation  vector. 

When  reference  co-ordinates  such  as  those  described  above  are  employed  in  multibody 
systems,  the  system  is  then  represented  in  terms  of  the  co-ordinates  q’^  =  [R,  0,  q,].  These 
co-ordinates  are  not  independent  because  the  motion  of  specific  points  in  different  bodies  are 
related  according  to  the  type  of  mechanical  joint  that  interconnects  them.  Moreover,  in  flexible 
mechanical  systems  the  deformation  of  a  component  affects  the  configuration  of  adjacent 
components.  As  a  consequence,  the  interdependence  of  the  generalized  co-ordinates  is  expressed 
by  a  vector  of  kinematic  constraint  equations,  such  as 

Olq,  f)  =  0  (5) 

where  q  is  the  total  vector  of  system  generalized  co-ordinates,  t  is  time,  and  <l>  is  the  vector  of 
linearly  independent  holonomic  constraint  equations.  These  constraint  equations  can  be  further 
classified  into:  (1)  rigid  body  constraints  where  only  rigid  body  variables  are  involved  in  the 
constraint  equation;  (2)  joint  constraints  where  both  rigid  body  and  deformation  co-ordinates  are 
included  in  the  constraint  equation:  (3)  time-dependent  constraints  wherein  the  constraint 
equations  can  be  explicit  functions  of  time  as  well  as  generalized  co-ordinates,  as  in  the  case  of 
imposing  the  co-ordinates  of  the  end-effector  to  follow  a  desired  trajectory.  To  illustrate  the 
construction  of  constraint  equations,  take  the  case  of  a  revolute  joint  which  connects  two  flexible 
planar  bodies  i  and  j  at  points  P  and  Q  shown  in  Figure  2.  The  two  constraint  equations 
corresponding  to  the  constraint  condition  that  requires  points  P  and  Q  to  be  coincident  can  be 
written  as 

(R' +  A‘ui.)-(R^  + A^u4)  =  0  (6) 

We  note  that  the  constraint  equation  exemplified  by  equation  (6)  forms  a  set  of  coupled 
non-linear  algebraic  equations  in  the  rigid  body  co-ordinates  and  deformation  co-ordinates. 


Figure  2.  A  pair  of  flexible  planar  bodies 
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Considering  the  rigid  body  and  deformation  co-ordinates  described  above  as  generalized 
co-ordinates,  and  following  standard  procedures  in  multibody  dynamics,  the  constrained  equa¬ 
tions  of  motion  become*’ 

.VI(q)q  -F  Cq  -t-  Kq  -H  Ojx  =  Q,  +  Q,(q,  q)  (7) 

where  M,  C  and  K  are  the  system  mass,  damping  and  stiffness  matrices,  respectively,  a  is  the 
vector  of  Lagrange  multipliers  associated  with  the  constraints,  <1>,  is  the  constraint  Jacobian 
matnx,  Q,.  is  the  vector  of  applied  external  forces,  and  Q,  is  the  quadratic  velocity  vector.  The 
quadratic  velocity  vector  contains  the  centrifugal  forces  and  Coriolis  forces  that  result  from  the 
differentiation  of  the  kinetic  energy  expression  with  respect  to  the  generalized  co-ordinates. 


2.1.  Forward  dynamics 

In  a  forward  dynamic  analysis,  i.e.  finding  the  resulting  motion  given  the  applied  joint  forces 
and  external  forces,  equations  (5)  and  (7)  form  a  mixed  system  of  differential-algebraic  equations 
that  have  to  be  solved  simultaneously.  As  explained  in  the  next  section,  the  solution  to  the  inverse 
dynamics  problem  requires  a  forward  dynamic  analysis  within  an  iteration  process.  We  solve  the 
forward  dynamics  problem  by  using  the  augmented  Lagrangian  penalty  formulation.^*  Applying 
the  augmented  Lagrangian  penalty  formulation  to  equations  (5)  and  (7)  results  in  the  following 
equation: 

M(q)q  -F  Cq  -(-  Kq  -t-  +  2)Ui>(i>  -F  =  Qe  +  Q,(q,  q)  -  (8) 

where  *  is  a  diagonal  matrix  of  penalty  factors  whose  elements  are  large  real  numbers  that  will 
assure  the  satisfaction  of  constraints,  and  to  and  }i  are  diagonal  matrices  representing  the  natural 
frequencies  and  damping  characteristics  of  the  dynamic  penalty  system  associated  with  the 
constraints.  Values  of  a  in  the  range  |0’  <  a  <  10*  provide  excellent  results  when  working  in 
double  precision.  The  augmented  Lagrangian  method  requires  an  iteration  for  the  correct  value 
of  the  Lagrange  multipliers.  The  iterative  equation  for  the  Lagrange  multipliers  is  given  by 

>.**  1  -F  Xf  -f  «[^  -F  2|ia><i>  -F  (9) 

The  Iterative  process  described  by  equation  (9)  involves  only  a  few  additional  operations  during 
each  iteration  but  it  significantly  improves  the  convergence  of  the  forward  dynamics  solution  as 
compared  to  the  standard  penalty  method.^* 

The  augmented  Lagrangian  penalty  formulation  has  several  advantages  over  the  standard 
algorithms  used  in  solving  differential-algebraic  equations.  First,  the  method  obviates  the  need  to 
solve  a  mixed  set  of  differential-algebraic  equations  and  does  not  increase  the  number  of 
equations  to  account  for  the  constraints.  Second,  this  method  allows  the  use  of  standard 
unconditionally  stable  algorithms  without  the  need  of  further  stabilization  techniques  to  control 
the  violation  of  constraints  during  the  integration  process.  Third,  the  method  can  handle 
redundant  constraints  and  allows  the  multibody  system  to  undergo  singular  positions.  Fourth, 
the  constraint  forces  (Lagrange  multipliers)  can  be  obtained  as  a  by-product  of  the  integration 
without  having  to  integrate  additional  equations.  Finally,  the  method  assures  convergence 
independent  of  the  penalty  values  used. 


2.2.  Inverse  kinematics  and  inverse  dynamics 

In  the  context  of  end-point  motion  and  vibration  control,  inverse  dynamics  refers  to  the 
problem  of  finding  the  actuating  forces  or  torques  that  will  cause  the  end  point  of  a  flexible 
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multibody  system  to  follow  a  desired  trajectory.  Moulin  and  Bayo^’  showed  that  because  of  the 
non-minimum-phase  character  of  the  inverse  problem,  the  unique  stable  solution  is  found  to  be 
non-causal.  i.e.  actuation  in  required  before  the  end-point  has  started  to  move  as  well  as  after  the 
end-point  has  stopped.  These  findings  have  led  to  new  theoretical  discoveries  in  the  inversion  of 
non-linear  non-minimum-phase  systems  such  as  fle.xible  multibodies.'*' In  addition,  the  fact 
that  the  stable  solution  starts  at  negative  time  and  extends  into  future  time  precludes  the  standard 
time-domain  integration  schemes  currently  available  in  multibody  computer  codes  from  obtain¬ 
ing  the  proper  inverse  solution.  These  codes  will  yield  causal,  and  hence  unstable,  results  and 
therefore  are  valid  only  for  the  forward  dynamics.  The  integration  process  is  therefore  essential  to 
obtain  non-causal  solutions,  and  the  time-anticipatory  effect  can  be  obtained  by  integrating  in  the 
frequency  domain  or  in  the  time  domain  using  the  bilateral  Laplace  transform. 

It  is  important  to  note  that  when  the  dynamic  effects  of  the  elastic  modes  are  small  (quasi-rigid 
cases),  causal  inverse  solutions  may  be  obtained  by  regularizing  the  problem  with  the  addition  of 
artificial  damping  either  through  the  damping  matrix  or  the  numerical  integration  scheme. 
However,  this  ad  hoc  process  changes  the  nature  of  the  problem  and  does  not  yield  the  desired 
time  delay  effect. 

Previous  solutions^ to  the  inverse  problem  relied  on  a  pinned-free  finite  element  model  of 
a  flexible  beam,  and  the  equation  for  the  inverse  dynamics  torque  was  formulated  by  imposing  the 
condition  that  the  transverse  deformation  of  the  free  end  of  each  link  be  zero  throughout  the 
motion.  This  type  of  model  led  to  a  recursive  scheme  to  solve  the  inverse  dynamics  of  multilink 
flexible  manipulators.  This  recursive  procedure  is  suitable  for  open-chain  but  not  for  closed-chain 
configurations. 

In  this  section,  we  describe  a  non-recursive  Lagrangian  approach  to  solve  the  general  planar 
inverse  dynamics  problem.  Compared  to  the  recursive  procedure,  this  non-recursive  Lagrangian 
approach  is  more  systematic  and  becomes  the  only  choice  when  closed-chain  systems  are 
encountered.  We  model  the  elastic  links  under  pinned-pinned  boundary  conditions.  This  allows 
us  to  express  the  end  effector  trajectory  in  terms  of  the  rigid  body  co-ordinates  only  and,  in 
addition,  leads  to  a  simplified  form  of  the  inverse  kinematics  equations  for  the  internal  states. 
Once  the  correct  internal  states  are  known,  the  equations  of  motion  give  an  explicit  expression  for 
the  inverse  dynamics  torque. 

In  partitioned  form,  equation  (7)  can  be  wntten  as 
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The  second  set  of  equations  in  equation  (10)  can  be  rearranged  to  express  the  externally  applied 
joint  forces  as 

Qc«  =  mgj(R  -(-  +  m^tif  +  —  Q,*  (11) 

Equation  (11)  constitutes  the  inverse  dynamics  equation  that  yields  the  joint  forces  (torques) 
necessary  for  the  end-point  to  follow  a  prescribed  trajectory.  In  order  to  obtain  Qj«,  the  nodal 
acceleration  vector  <jf  is  needed.  This  vector  can  be  obtained  from  the  third  set  of  equations  in 
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equation  (10),  which  can  be  written  as 

nifftif  +  Cffqr  +  kffq,  =  Q.f  +  Q,,  -  <Dj,X.  -  mf^R  -  tnf«e  (12) 

The  vector  of  applied  nodal  forces  Q^f  can  be  expressed  in  terms  of  the  externally  applied  torques 
through  the  following  mapping: 

Qef  =  GfQe»  (13) 

where  in  the  planar  case,  the  matrix  Gf  is  a  constant  Boolean  matrix  which  maps  the  externally 
applied  torques  to  the  vector  of  externally  applied  nodal  forces.  For  example,  in  the  open-chain 
planar  multibody  system  shown  in  Figure  3,  the  Boolean  matrix  Gf  is  constructed  such  that  the 
external  moment  on  the  node  located  at  the  base  of  the  first  link  is  equal  to  the  base  motor  torque, 
the  moment  on  the  node  located  at  the  tip  of  the  first  link  is  the  negative  of  the  elbow  motor 
torque,  the  external  moment  on  the  node  located  at  the  base  of  the  second  link  is  equal  to  the 
elbow  motor  torque,  and  all  other  external  forces  are  zero.  The  same  technique  can  be  applied  for 
the  closed-chain  multibody  system  shown  in  Figure  9,  where  the  Boolean  matrix  Gf  is  construc¬ 
ted  such  that  the  externally  applied  nodal  forces  are  equal  to  the  motor  torques  at  the  rotational 
degree  of  freedom  of  the  nodes  where  the  motors  are  located  and  zero  elsewhere.  Substituting 
equations  (1 1)  and  (13)  into  equation  (12)  results  in 

mffqr  +  Cffqf  4-  kf^j,  =  Gfin^flif  -i-  F,(x,  q„  q„  q„  qf,  qf)  (14) 

where  Ft  is  a  force  vector  that  includes  the  inertial  terms,  reaction  terms  between  contiguous 
bodies  and  quadratic  velocity  terms. 

The  problem  statement  for  the  inverse  kinematics  is  that  of  finding  the  non-causal  internal 
states  qf  so  that  the  end-point  co-ordinates  characterized  by  a  subset  of  the  rigid  body  co¬ 
ordinates  q,  follow  a  prescribed  trajectory.  The  inverse  kinematic  equations  of  equation  (14)  are 
non-linear  in  the  variable  qf.  The  non-linear  non-causal  inversion  cannot  be  carried  out  by 
standard  numerical  integration  of  ODEs.  It  requires  a  linearization  process  in  either  the 
frequency  domain  or  the  time  domain,  or  splitting  the  linearized  system  into  its  causal  and 
anticausal  components. 

The  key  to  the  linearization  process  for  the  non-recursive  approach  relies  on  decomposing  the 
inertial  coupling  submatrix  m^f  into  the  sum  of  a  time-invariant  matrix  and  a  time-varying  matrix 

mgf  ==  m|f  4- (15) 


Figure  3.  Open-chain  flexible  multibody  system 
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where  m|,  and  are  the  time-invariant  part  and  time-varyng  part  of  m»f,  respectively.  This 
decomposition  is  essential  for  the  iteration  process  needed  to  obtain  the  non-causal  solution  to 
the  non-linear  inversion  problem.  Substituting  equation  (15)  into  Equation  (14).  we  obtain  the 
inverse  kinematics  equation  of  motion  for  the  nodal  displacements  q,: 

+  Cffqr  +  Mf  =  qr.  qr.  qr.  qr.  qr.  qr)  (16) 

where 

ni^  =  niff  —  Gf  mlf  ( 1 7) 


The  mass  matrix  mff  is  non-symmetric  and  it  is  precisely  the  non-symmetry  of  the  mass  matrix 
that  produces  internal  states  which  are  non-causal  with  respect  to  the  end-point  motion  when 
non-causal  techniques  are  employed  to  obtain  the  proper  inversion  of  the  non-linear,  non¬ 
minimum  problem  characterized  by  equation  (16).  The  non-linear  inversion  can  now  be  carried 
out  efficiently  in  the  frequency  domain,  since  the  leading  matrices  have  been  constructed  such  that 
they  remain  constant  throughout  the  motion.  We  thus  solve  equation  (16)  iteratively  in  the 
frequency  domain  to  yield  the  nodal  deformation  vector  qf  that  is  non-causal  with  respect  to  the 
end-point  motion. 

In  the  frequency  domain,  equation  (16)  can  be  written  as  a  set  of  complex  equations  for 
a  particular  frequency  cu 


+  ~  Cff  —  -z-j  kff 
10)  O) 


qricD)  =  F(d)) 


(18) 


where  qf(a))  is  the  Fourier  transform  of  qf(t)  and  F(a))  is  the  Fourier  transform  of  F(r).  Equation 
( 18)  is  based  on  the  assumption  that  qt(r)  and  F(t)  are  Fourier-transformable.  This  assumption  is 
valid  for  slewing  motions  which  are  from  rest  to  rest.  The  nodal  acceleration  vector  qffd))  can  be 
obtained  directly  from  equation  (18)  for  each  frequency  d>.  The  leading  matrix  of  equation  (18)  is 
a  complex  regular  matrix  that  is  invertible  for  all  frequencies  except  d>  =  0.  However,  for  w  =  0. 
the  system  undergoes  a  rigid  body  motion  determined  only  by  the  invertible  mass  matrix  m'f. 
However,  we  note  that  the  forcing  vector  on  the  right-hand  side  of  equation  (18)  depends  on  the 
nodal  deformations,  velocities  and  accelerations.  We  use  a  successive  substitution  scheme  to 
iterate  for  the  nodal  deformations,  velocities  and  accelerations.  Finally,  the  nodal  accelerations  in 
the  time  domain  may  be  obtained  through  the  application  of  the  inverse  Fourier  transform,  i.e. 


qr(w)e‘"'  do) 

“  * 


(19) 


Once  the  non-causal  nodal  accelerations  arc  known,  equation  (11)  can  be  used  to  compute 
explicitly  the  non-causal  inverse  dynamics  joint  efforts  that  will  move  the  end  effector  according 
to  a  desired  trajectory.  We  note,  however,  that  the  inverse  dynamics  torque  and  internal  states 
given  by  equations  (1 1)  and  (16),  respectively,  depend  on  the  Lagrange  multipliers  and  rigid  body 
co-ordinates,  which  in  turn  depend  on  the  internal  states  and  the  applied  torque.  Moreover,  the 
rigid  body  co-ordinates  and  Lagrange  multipliers  are  different  from  their  nominal  values  when 
the  components  of  the  multibody  system  are  flexible.  Therefore,  a  forward  dynamic  analysis  is 
required  to  obtain  an  improved  estimate  of  the  generalized  co-ordinates  and  Lagrange  multi¬ 
pliers.  In  order  to  ensure  that  the  iteration  process  converges  to  obtain  the  joint  efforts  that  will 
cause  the  end-effector  to  follow  the  desired  trajectory,  the  forward  dynamics  analysis  is  carried 
out  with  the  additional  constraint  that  the  co-ordinates  of  the  end-point  follow  the  desired 
trajectory.  These  additional  constraints  have  corresponding  Lagrange  multipliers  which  act  as 
correcting  terms  to  the  joint  efforts  that  have  previously  been  calculated. 
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It  is  important  to  note  that  the  computation  of  the  nodal  acceleration  vector  in  each  iteration 
can  also  be  carried  out  in  the  time  domain  through  the  use  of  the  bilateral  Laplace  transform 


qf(f)  = 


^  h,(t  -  r)/(T)dr 

-  X.  i  s  I 


(20) 


where  h^t)  is  the  acceleration  response  vector  to  an  impulse  applied  to  the  ith  degree  of  freedom 
and  f,U)  is  the  ith  component  of  the  forcing  term  on  the  right-hand  side  of  equation  (16).  We  note 
that  the  integration  from  -  x  to  x  is  necessary  to  capture  the  non-causal  effects. 

To  summarize,  the  procedure  for  obtaining  the  inverse  dynamics  solution  for  flexible  multi¬ 
body  systems  involves  the  following  steps: 


Algorithm 

1.  Perform  a  rigid  body  inverse  dynamic  analysis  to  obtain  the  nominal  values  of  the  rigid 
body  co-ordinates  q,  and  Lagrange  multipliers  X. 

2.  Solve  the  inverse  kinematics  equation  in  the  frequency  domain  through  equation  (16)  or  in 
the  time  domain  through  equation  (20)  to  obtain  the  time-delayed  nodal  accelerations  (jf. 

3.  Compute  the  inverse  dynamics  joint  efforts  Q^g  using  equation  (11). 

4.  Perform  a  forward  dynamic  analysis  using  equation  (8)  to  obtain  new  values  for  the 
generalized  co-ordinates  and  Lagrange  multipliers. 

5.  Repeat  steps  2-4  until  convergence  in  the  inverse  dynamics  torques  is  achieved. 

It  is  worthwhile  to  compare  the  recursive  procedure^^  and  the  non-recursive  Lagrangian 
procedure  for  the  inverse  dynamics  of  multibody  systems.  In  the  former  method,  each  body  in  the 
multibody  system  is  analysed  sequentially  starting  from  the  last  element  in  the  chain.  For  each 
element,  the  Joint  torques  are  determined  first  under  the  assumption  that  the  rigid  body 
co-ordinates  are  moving  according  to  the  nominal  trajectory.  With  the  joint  actuation  known  for 
this  component,  a  forward  dynamic  analysis  is  casrried  out  to  determine  the  nodal  deformations, 
and  the  reaction  forces  from  the  next  element  in  the  chain  are  subsequently  determined  from 
equilibrium  considerations.  This  recursive  method  works  very  well  for  open-chain  systems,  but  it 
is  not  suitable  for  closed-chain  systems  because  it  requires  the  analyst  to  cut  the  loop  of  the  closed 
chain  and  account  for  the  reaction  forces  at  the  cuts  through  ad  hoc  procedures.  The  non¬ 
recursive  Lagrangian  method  avoids  this  problem  since  the  reaction  between  system  components 
are  automatically  accounted  for  by  the  Lagrange  multipliers  and  no  distinction  is  made  between 
open-chain  and  closed-chain  configurations.  The  non-recursive  procedure  is  thus  more  system¬ 
atic  and  general. 


3.  SIMULATION  RESULTS 

In  this  section  we  present  the  results  of  numerical  simulations  that  verify  the  procedure  discussed 
above.  First,  we  apply  the  proposed  non-recursive  Lagrangian  approach  to  an  open-chain  flexible 
multibody  system  and  compare  the  results  with  those  obtained  by  the  recursive  method^  ^  to  test 
the  validity  of  the  proposed  procedure.  Next,  we  present  the  results  of  the  application  of  the 
non-recursive  Lagrangian  approach  to  a  closed-chain  flexible  multibody  system  to  determine  the 
inverse  dynamics  torque  that  will  produce  the  desired  motion  at  the  end  elTector. 

S.i.  Open-chain  multibody  system 

Figure  3  shows  a  two-link  flexible  multibody  system  in  the  horizontal  plane.  The  end-point  of 
the  second  link  is  specified  to  move  along  the  x-axis  according  to  the  acceleration  profile 
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Figure  4.  End-point  acceleration  along  the  .x-axis 


described  by  Figure  4,  which  corresponds  to  an  end-point  displacement  of  0483  meters  along  the 
.t-axis.  The  geometric  and  material  properties  of  the  links  arc  as  follows: 

First  link: 

Length  0-66  m 

Cross-sectional  area  1-2  x  10“‘  m^ 

Cross-sectional  second  moment  of  area  2-3  x  10"  m** 

Second  link: 

Length:  0-66  m 

Cross-sectional  area  40  x  10" ’ m^ 

Cross-sectional  second  moment  of  area  8-5  x  10" m* 

The  two  links  share  the  following  properties: 

Young’s  modulus  14  GPa 
Mass  density  2715  kg/m^ 

In  Figure  5,  the  inverse  dynamics  torque  profile  for  the  base  motor  using  the  non-recursive 
method  is  superimposed  on  the  inverse  dynamics  torque  profile  determined  by  the  recursive 
method.  The  inverse  dynamics  torque  profiles  for  the  elbow  motor  computed  by  the  two 
aforementioned  methods  are  superimposed  in  Figure  6.  Both  the  recursive  and  non-recursive 
formulations  yield  the  same  result  and  can  be  superimposed  on  each  other  (solid  curve),  thus 
validating  the  proposeti  method.  The  corresponding  rigid  body  torques  are  also  shown  as  dashed 
curves  in  Figures  5  and  6  to  illustrate  the  pre-actuation  and  post-actuation  present  in  the  inverse 
dynamics  flexible  torque  profiles. 

Figure  7  shows  comparison  of  the  elastic  angular  rotation  at  the  base  of  the  second  link 
obtained  by  a  feedforward  of  the  inverse  dynamics  torque  (solid  curve)  to  that  obtained  by 
a  feedforward  of  the  rigid  body  torque  (dashed  curve).  We  observe  that  while  the  inverse 
dynamics  torque  does  not  induce  residual  vibration,  the  rigid  body  torque  induces  substantial 
residual  oscillation. 
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Figure  5.  Base  motor  inverse  dynamics  ( - )  and  rigid  ( . )  torques 
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Figure  6.  Elbow  motor  inverse  dynamics  ( - )  and  rigid  { . )  torques 

Figure  8  shows  a  comparison  of  the  vertical  tip  error  obtained  by  a  feedforward  of  the  inverse 
dynamics  torque  obtained  using  the  non-recursive  approach  (solid  curve)  and  the  vertical  tip 
error  obtained  by  a  feedforward  of  the  rigid  body  torque  (dashed  curve).  Wc  observe  that  while 
the  inverse  dynamics  torque  jprovides  an  excellent  tracking  of  the  tip  trajectory,  the  rigid  body 
torque  induces  a  large  oscillation  in  the  tip  motion. 

3.2.  Closed-chain  multibody  system 

Figure  9  shows  a  closed-chain  flexible  multibody  system  made  up  of  four  flexible  links  with  two 
joints  which  are  fixed  against  translation  relative  to  the  ground.  As  in  the  open-chain  case,  the 
multibody  system  is  assumed  to  lie  on  a  horizontal  plane  so  that  gravity  effects  are  neglected.  The 
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Figure  8.  Vertical  tip  error;  inverse  dynamics  ( - )  vs.  rigid  ( . )  torques 


Figure  9.  Closed-chain  flexible  multibody  system 
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desired  trajectory  of  joint  5  is  a  straight  line  at  45'’  with  respect  to  the  x-  and  y-axis.  The  x-  and 
y-components  of  the  acceleration  of  joint  5  are  specified  to  follow  the  acceleration  profile  shown 
in  Figure  10.  The  four  links  share  the  following  geometric  and  material  properties: 

Length  0  60  m 

Cross-sectional  area  40xl0'’m“ 

Cross-sectional  second  moment  of  area  8  5  x  10"  **  m* 

Young's  modulus  14GPa 
Mass  density  2715  kg,  m^ 

Figure  1 1  shows  the  inverse  dynamics  torque  profile  at  joint  1  obtained  by  the  non-recursive 
method  (solid  curve).  The  rigid  body  inverse  dynamics  torque  profile  is  superimposed  for 
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Figure  10.  End-pomt  acceleration  along  the  x-  and  >-axis 
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Figure  11  Joint  1  inverse  dynamics  ( - )  and  rigid  ( 
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comparison  (dashed  curve).  The  fig  ire  shows  the  non-causal  (time-anticipatory)  character  of  the 
solution  for  the  inverse  problem.  Figure  12  shows  the  inverse  dynamic  torque  profile  (solid  curve) 
at  joint  3  superimposed  with  the  corresponding  rigid  body  torque  profile  (dashed  curve).  Again, 
the  time  delay  due  to  the  non-causality  of  the  solution  is  seen  in  this  figure. 

Figure  13  shows  the  elastic  angular  rotation  at  the  base  of  the  first  link  obtained  by  a  feedfor¬ 
ward  of  the  inverse  dynamics  torque  (solid  curve).  Superimpossed  in  the  same  figure  is  the 
corresponding  elastic  angular  rotation  obtained  by  a  feedforward  of  the  rigid  body  torque 
(dashed  curve).  Whereas  the  rigid  body  torque  produces  residual  angular  rotations,  the  inverse 
dynamic  torque  does  not  show  residual  angular  rotations.  As  a  matter  of  fact,  it  was  observed  in 
the  simulations  that  the  rigid  body  torques  produced  residual  vibration  in  all  the  nodal 
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Figure  1 2.  Joint  3  inverse  dynamics  ( - )  and  rigid  ( . !  torques 
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deformations  while  the  inverse  dynamics  torques  eliminated  the  residual  oscillation.  Further¬ 
more,  the  inverse  dynamics  torques  produced  nodal  deformations  which  exhibited  non-causal 
characteristics  with  respect  to  the  end-point  motion.  Figure  14  shows  a  comparison  of  the  vertical 
tip  error  at  joint  5  obtained  by  a  feedforward  of  the  inverse  dynamics  torque  (solid  curve)  with  the 
tip  error  resulting  from  a  feedforward  of  the  rigid  body  torque  (dashed  curve).  This  figure  shows 
that  the  inverse  dynamics  torque  provides  an  excellent  tracking  of  the  desired  end  effector 
trajectory  whereas  the  rigid  body  torque  again  induces  substantial  vibration  on  the  end-point 
motion. 

The  numerical  simulations  reported  in  this  paper  were  carried  out  using  a  single-processor  Sun 
Sparcstation  1.  For  the  open-chain  multibody  system,  the  CPU  time  was  14  s  for  the  recursive 
method  and  37  s  for  the  non-recursive  method.  As  expected,  the  recursive  method  is  more  efficient 
than  the  non-recursive  method  for  open-chain  problems  since  the  former  solves  the  system 
equations  one  link  at  a  time,  hence  significantly  reducing  the  dimension  of  the  problem.  However 
the  non-recursive  method  can  easily  be  amended  to  take  advantage  of  parallel  processors,  hence 
enabling  the  method  to  become  more  attractive  when  multiprocessors  are  used. 

For  the  closed-chain  multibody  system  presented  above,  the  non-recursive  method  was  used  to 
solve  the  inverse  dynamics  problem  with  a  CPU  time  of  96  s.  The  use  of  the  recursive  procedure 
for  the  closed-chain  system  would  require  an  ad  hoc  procedure  where  the  analyst  must  decide 
where  to  cut  the  chain  and  impose  proper  boundary  conditions  on  the  resulting  tree-structured 
open-chain  system.  Hence,  the  recursive  method  becomes  too  cumbersome  for  the  inverse 
dynamics  of  closed-chain  systems,  and  for  this  reason  it  was  not  used  to  simulate  the  closed-chain 
system.  ' 


4.  CONCLUSION 

A  non-recursive  Lagrangian  approach  for  the  inverse  dynamics  of  flexible  multibody  systems  has 
been  presented.  The  procedure  is  capable  of  solving  for  the  non-causal  inverse  dynamics  torque 
profiles  of  both  open-chain  and  closed-chain  flexible  multibody  systems  in  a  unified  and 
systematic  manner.  The  method  is  found  to  produce  an  excellent  tracking  of  the  desired  trajectory 
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of  the  end  effector  In  a  future  paper,  we  will  address  the  inverse  dynamics  problem  for  flexible 
multibody  systems  undergoing  motion  in  three  dimensions.  New  problems  arise  in  the  three- 
dimensionai  case,  since  the  actuating  torque  vectors  have  directions  which  are  time-varying  and 
non-linear  functions  of  the  rigid  body  co-ordinates,  in  contrast  to  the  planar  case  where  the 
applied  torque  vectors  have  directions  fixed  perpendicular  to  the  plane  of  the  multibody  system. 
In  addition,  controllability  and  accessability  issues  need  to  be  addressed. 
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A  Lagrangian  Approach  to  the  Non-causal  Inverse  Dynamics  of 
Flexible  Multibody  Systems:  The  Three-Dimensional  Case 

Ragnar  Ledesma 
Eduardo  Bayo 

Department  of  Mechanical  Engineering 
University  of  California 
Santa  Barbara,  CA  93106 

ABSTRACT 

This  paper  addresses  the  problem  of  end-point  trajectory  tracking  in 
flexible  multibody  systems  through  the  use  of  inverse  dynamics.  A  global 
Lagrangian  approach  is  employed  in  formulating  the  system  equations  of 
motion,  and  an  iterative  procedure  is  proposed  to  achieve  end-point  trajec¬ 
tory  tracking  in  three-dimensional,  flexible  multibody  systems.  Each  itera¬ 
tion  involves  firstly,  a  recursive  inverse  kinematics  procedure  wherein 
elastic  displacements  are  determined  in  terms  of  the  rigid  body  coordi¬ 
nates  and  Lagrange  multipliers,  secondly,  an  explicit  computation  of  the 
inverse  dynamic  joint  actuation,  and  thirdly,  a  non-recursive  forward 
dynamic  analysis  wherein  generalized  coordinates  and  Lagrange  multi¬ 
pliers  are  determined  in  terms  of  the  joint  actuation  and  desired  end-point 
coordinates.  In  contrast  with  the  recursive  methods  previously  proposed, 
this  new  method  is  the  most  general  since  it  is  suitable  for  both  open-chain 
and  closed-chain  configurations  of  three-dimensional  multibody  systems. 
The  algorithm  yields  stable,  non-causal  actuating  joint  torques  and  associ¬ 
ated  Lagrange  multipliers  that  account  for  the  constraint  forces  between 
flexible  multibody  compionents. 
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1.  Introduction 

The  effect  of  elastic  deformation  on  the  dynamics  of  multibody  systems  has  been 
vigorously  studied  during  the  past  thirty  years.  In  particular,  the  modeling  of  multibody 
components  as  elastic  beams  has  received  considerable  attention  as  made  evident  in  the 
survey  papers  of  Lowen  and  Jandrastis,  Erdman  and  Sandor,  Modi,  and  more  recently 
by  Lowen  and  Chassapis.^  A  specific  area  of  interest  with  regards  to  flexible  multibody 
systems,  especially  in  the  aerospace  and  robotics  industries,  is  in  controlling  the  motion 
of  a  specified  point  in  the  multibody  system.  In  most  cases,  the  control  objective  is  to 
have  the  end-point  of  the  multibody  system  follow  a  desired  trajectory.  Various  feedback 
control  strategies  for  the  problem  of  end-point  trajectory  tracking  have  been  proposed, 
and  the  survey  papers  of  Balas^  and  Book^  present  some  of  the  approaches  advanced  by 
the  controls  community  towards  this  problem. 

The  problem  of  end-point  trajectory  tracking  in  flexible  multibody  systems  has  led 
to  the  development  of  computational  methods  commonly  referred  to  as  inverse  dynamics. 
Inverse  dynamics  deals  with  the  problem  of  determining  the  joint  actuation  that  will 
cause  a  specified  control  point  in  the  flexible  multibody  system  to  follow  a  desired  trajec¬ 
tory.  The  pioneering  work  of  Reference  7  on  the  trajectory  control  of  a  single  flexible 
link  through  inverse  dynamics  showed  that  the  inverse  dynamic  torque  is  non-causal 
with  respect  to  the  end-point  motion,  i.e.,  actuation  is  required  before  the  end-point  has 

Q 

started  to  move  as  well  as  after  the  end-point  has  stopped.  Moulin  demonstrated  that 
because  of  the  non-minimum  phase  character  of  the  inverse  dynamics  for  the  trajectory 
tracking  problem,  the  only  bounded  solution  for  the  inverse  dynamic  torque  has  to  be 
non-causal.  Bayo,  et.  al.^  extended  the  inverse  dynamics  to  planar,  multiple-link  systems 
using  an  iterative  frequency  domain  approach.  The  recursive  method  proposed  in  that 
study  is  suitable  for  planar  open-chain  systems,  but  required  an  ad  hoc  procedure  for 
planar  closed-chain  systems.  A  time  domain  inverse  dynamics  technique  based  on  the 
non-causal  impulse  response  function  was  presented  by  Bayo  and  Moulin*®  for  the  single 
link  system,  with  provisions  for  extension  to  multiple  link  systems.  An  equivalent  time 
domain  approach  for  a  single  link  arm  was  proposed  by  Kwon  and  Book**  where  the 
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non-causality  of  the  computed  torque  was  captured  by  dividing  the  inverse  system  into 
causal  and  anticausal  parts.  Recently,  a  more  systematic  and  more  general  non-recursive, 
frequency  domain  method  for  the  inverse  dynamics  of  planar  multibody  systems  was 
proposed  in  Reference  12.  This  method  includes  the  constraint  forces  between  the  multi¬ 
body  components  in  the  equations  of  motion,  and  the  method  is  found  suitable  for  both 
open-chain  and  closed-chain  configurations  of  planar  multibody  systems.  The  effect  of 
Coriolis  forces  and  centrifugal  forces  on  the  inverse  dynamics  of  constrained  mechanical 
systems  was  presented  by  Gofron  and  Shabana.^^ 

The  inverse  dynamics  approach  to  end-point  trajectory  tracking  of  open-chain  flexi¬ 
ble  multibody  systems  was  recently  applied  to  the  three-dimensional  problem  by 
Ledesma,  et.  al.}^  where  a  recursive  procedure  was  proposed  to  simultaneously  track  a 
desired  end-point  trajectory  and  minimize  motion-induced  vibrations  through  the  com¬ 
bined  use  of  lumped  inverse  dynamic  torques  and  distributed  piezoelectric  actuators.  The 
recursive  procedure  required  a  controlled  motor  at  each  intermediate  revolute  joint  and 
three  motors  at  the  ground.  This  procedure  is  effective  for  open-chain  systems,  but  it  is 
not  valid  for  closed-chain  systems  because  in  such  systems,  the  number  of  required  con¬ 
trol  inputs  is  less  than  the  number  of  joints. 

In  this  paper,  we  present  a  general  computational  approach  for  the  solution  of  the 
non-causal  inverse  dynamics  of  three-dimensional,  flexible  multibody  systems,  that  is 
suitable  for  both  open-chain  and  closed-chain  configurations.  With  this  work,  we  present 
a  methodology  that  is  suitable  for  all  multibody  systems,  ranging  from  the  single  link 
case  to  three-dimensional  systems  with  general  topologies.  The  equations  of  motion  are 
formulated  in  Section  2  and  an  iterative  algorithm  is  subsequently  developed.  Simulation 
results  for  open-chain  and  closed-chain  configurations  are  presented  in  Section  3  to 
demonstrate  the  validity  and  accuracy  of  the  method. 
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2.  Problem  Formulation 

In  this  section,  we  derive  the  governing  equations  of  motion  for  a  flexible  multibody 
system  by  using  a  global  Lagrangian  approach,  and  develop  a  solution  for  the  inverse 
dynamics  problem  that  is  suitable  for  both  open-chain  and  closed-chain  configurations. 
Crucial  to  the  success  of  the  proposed  procedure  is  the  use  of  the  correct,  non-causal 
Lagrange  multipliers  that  account  for  the  constraint  forces  between  flexible  multibody 
components.  These  Lagrange  multipliers  are  determined  in  a  forward  dynamic  analysis, 
which  in  turn,  require  the  unknown  inverse  dynamic  actuations.  Therefore,  the  proposed 
solution  is  an  iterative  procedure  which  converges  to  the  stable,  non-causal  inverse 
dynamic  actuations  and  the  associated  Lagrange  multipliers. 

Consider  an  n-body  flexible  multibody  system  such  as  that  shown  in  Fig.  1.  A.  typi¬ 
cal  multibody  component,  say  body  i ,  is  shown  in  Fig.  1  along  with  the  floating  reference 
frame  associated  with  that  body.  The  generalized  coordinates  consist  of  rigid  body  coor¬ 
dinates  which  describe  the  position  and  orientation  of  the  floating  reference  frame 
associated  with  each  multibody  component,  and  deformation  coordinates  q/  which 
describe  the  deformation  of  the  flexible  body  with  respect  to  its  floating  reference  frame. 
The  rigid  body  coordinates  consist  of  the  Cartesian  coordinates  R*  which  describe  the 
position  of  the  origin  of  the  floating  reference  frame  associated  with  body  i ,  and  a  set  of 
Euler  parameters  0‘  which  describe  the  orientation  of  the  floating  firame.  The  use  of  Euler 
parameters  among  several  choices  of  orientation  coordinates  will  be  explained  later  in 
the  section  describing  the  inverse  dynamics  solution  procedure.  The  deformation  from 
the  nominal  configuration  is  assumed  to  be  small,  so  that  the  different  bending  and  tor¬ 
sional  modes  are  decoupled.  For  the  sake  of  completeness,  we  summarize  in  the  follow¬ 
ing  equations  the  basic  kinematic  expressions  that  lead  to  the  general  equations  of  motion 
for  flexible  multibody  systems.  A  more  detailed  formulation  is  found  in  Reference  15. 
With  the  above  choice  of  coordinates,  the  position  of  an  arbitrary  point  P  in  body  i  is 
given  by 

r‘  =  R‘  +  A*  u‘ 


(1) 


-5- 


where  R‘  is  the  location  of  the  origin  of  the  body  axes  with  respect  to  the  inertial  frame, 
u‘  is  the  location  of  point  P  with  respect  to  the  body  axes,  and  A‘  is  the  rotation  transfor¬ 
mation  matrix  from  the  body  axes  to  the  inertial  frame.  In  the  three-dimensional  case,  the 


rotation  transformation  matrix  is  given  by 


A*  = 


2(0^-^0?)-l 
2(0102  +  0003) 
2(0103  “  0002) 


2(0102  ~  0003) 
2(0,?  +  0^)-l 
2(0203  +  0O0l) 


2(0103  +  0002) 
2(0203  “  0O0l) 

2(0(?  +  0^)-l 


(2) 


where  the  orientation  coordinates  are  represented  by  four  Euler  parameters  0(i,  0f  ,  0^,  and 
0j  which  satisfy  the  following  identity: 


i  (0i)2=l. 

k=0 


(3) 


The  position  vector  with  respect  to  the  body  axes,  u* ,  can  be  decomposed  into 

U‘  =  Ur‘  +  U/  (4) 

where  u/  is  the  position  vector  of  point  P  in  the  undeformed  state  with  respect  to  the 
body  axes,  and  u/  is  the  deformation  vector  of  point  P  with  respect  to  the  body  axes. 
The  deformation  vector  u/  can  be  expressed  in  terms  of  the  nodal  deformations  by  using 
a  finite  element  discretization  scheme,  hence 

u/  =  N‘  q/  (5) 


where  N*  is  the  shape  function  matrix  and  q/  is  the  nodal  deformation  vector.  Differen¬ 
tiating  Eq.  (1)  with  respect  to  time,  we  obtain  the  following  expression  for  the  velocity 
vector  in  terms  of  the  rigid  body  coordinates  and  nodal  deformation  coordinates: 

P  =  R'  -  2  A‘  u'  E*  0'  f  A*  N‘  q/  (6) 

where  ( ')  represents  differentiation  with  respect  to  time,  E‘  is  a  matrix  that  depends 


linearly  on  the  Euler  parameters  and  is  given  by 


E‘  = 


-01  00  03  “^2  ‘ 

—02  —03  00  01 

-03  02  -01  00 


(7) 


(8) 


and  u‘  is  a  3  X  3  skew-symmetric  matrix  given  by 


u‘ 


0  -M*  Uy 

Uj  0  -Ux 

-Uy  Ux  0 


in  which  Ux,  Uy,  and  Wj  are  the  coordinates  of  the  generic  point  P  with  respect  to  the 
body  axes,  in  the  deformed  configuration. 


Considering  the  reference  coordinates  =  [  R^.S^.q/"  ]  as  generalized  coordinates 
for  the  flexible  multibody  system,  these  coordinates  are  not  independent  because  the 
motion  of  specific  points  in  different  bodies  are  related  according  to  the  type  of  mechani¬ 
cal  joints  that  interconnect  them.  Moreover,  in  flexible  mechanical  systems,  the  defor¬ 
mation  of  a  component  affects  the  configuration  of  adjacent  components.  As  a  conse¬ 
quence,  the  interdependence  of  the  generalized  coordinates  is  expressed  by  a  vector  of 
kinematic  constraint  equations,  such  as 


<I>(q./)  =  0  (9) 

where  q  is  the  total  vector  of  system  generalized  coordinates,  t  is  time,  and  <I>  is  the  vec¬ 
tor  of  linearly  independent  holonomic  constraint  equations.  These  constraint  equations 
can  be  further  classified  into: 

1.  rigid  body  constraints  where  only  rigid  body  variables  are  involved  in  the  constraint 
equation; 

2.  joint  constraints  where  both  rigid  body  and  deformation  coordinates  are  included  in 
the  constraint  equation;  and 

3.  rheonomic  constraints  wherein  the  constraint  equations  can  be  explicit  functions  of 
time  as  well  as  generalized  coordinates. 

The  third  type  of  constraint  becomes  active,  for  example,  in  the  case  of  imposing 
the  coordinates  of  the  end-effector  to  follow  a  desired  trajectory.  To  illustrate  the  con¬ 
struction  of  constraint  equations,  take  the  case  of  a  spherical  joint  which  connects  two 
flexible  bodies  i  and  j  at  points  P  and  Q  shown  in  Fig.  2.  The  three  constraint  equations 
corresponding  to  the  constraint  condition  that  requires  points  P  and  Q  to  be  coincident 
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can  be  written  as 

[r‘  +  A'  u/ij  -  [r^  +  A>  u^]  =0.  (10) 

We  note  that  the  constraint  equation  exemplified  by  Eq.  (10)  forms  a  set  of  coupled  non¬ 
linear  algebraic  equations  in  the  rigid  body  coordinates  and  deformation  coordinates. 

Considering  the  ngid  body  and  deformation  coordinates  described  above  as  general¬ 
ized  coordinates,  and  following  standard  procedures  in  multibody  dynamics,  the  con¬ 
strained  equations  of  motion  become 

M(q)q-»-Cq-hKq  +  <DTX  =  Q,-t-Qv(q.q)  (11) 

where  M,  C  and  K  are  the  system  mass,  damping  and  stiffness  matrices,  respectively,  X 
is  the  vector  of  Lagrange  multipliers  associated  with  the  constraints,  Oq  is  the  constraint 
Jacobian  matrix,  Q,  is  the  vector  of  applied  external  forces,  and  Qv  is  the  quadratic  velo¬ 
city  vector.  The  quadratic  velocity  vector  contains  the  centrifugal  forces  and  Coriolis 
forces  that  result  from  the  differentiation  of  the  kinetic  energy  expression  with  respect  to 
the  generalized  coordinates.  Geometric  stiffening  due  to  high  rotation  rates  can  also  be 
added  to  the  vector  Qy. 

2.1.  Forward  Dynamics 

In  a  forward  dynamic  analysis,  i.e.,  finding  the  resulting  motion  given  the  applied 
joint  forces  and  external  forces,  Eqs.  (9)  and  (11)  constitute  a  mixed  system  of 
differential-algebraic  equations  that  have  to  be  integrated  simultaneously.  As  explained 
in  the  next  section,  the  solution  to  the  inverse  dynamics  problem  requires  a  forward 
dynamic  analysis  within  an  iteration  process.  We  solve  the  forward  dynamics  problem 
by  using  the  augmented  Lagrangian  penalty  formulation.^^  Applying  the  augmented 
Lagrangian  penalty  formulation  to  Eqs.  (9)  and  (1 1)  results  in  the  following  equation: 

M(q)  q  +  Cq  +  Kq  +  O^  +  2p,Q)<i)  +  to2<I>j  =Qe  +  Qv(q,q) 

-  OqT  X* 


(12) 
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where  a  is  a  diagonal  matrix  of  penalty  factors  whose  elements  are  large  real  numbers 
that  will  assure  the  satisfaction  of  constraints,  cd  and  |i.  are  diagonal  matrices  representing 
the  natural  frequencies  and  damping  characteristics  of  the  dynamic  penalty  system  associ¬ 
ated  with  the  constraints.  Values  of  a  in  the  range  10^  <  a  <  10^  provide  excellent 
results  when  working  in  double  precision.  The  augmented  Lagrangian  method  requires 
an  iteration  for  the  correct  value  of  the  Lagrange  multipliers.  The  iterative  equation  for 
the  Lagrange  multipliers  is  given  by 


a  O 


^1*1  =X.*-f-a 


O  . 


(13) 


The  iterative  process  described  by  Eq.  (13)  involves  only  a  few  additional  operations  dur¬ 
ing  each  iteration  but  it  sigruficantly  improves  the  convergence  of  the  forward  dynamics 
solution  as  compared  to  the  standard  penalty  method. 

The  augmented  Lagrangian  penalty  formulation  has  several  advantages  over  the 
standard  algorithms  used  in  solving  differential-algebraic  equations.  First,  the  method 
obviates  the  need  to  solve  a  mixed  set  of  differential-algebraic  equations  and  does  not 
increase  the  number  of  equations  to  account  for  the  constraints.  Second,  this  method 
allows  the  use  of  standard  unconditionally  stable  algorithms  without  the  need  of  further 
stabilization  techniques  to  control  the  violation  of  constraints  during  the  integration  pro¬ 
cess.  Third,  the  method  can  handle  redundant  constraints  and  allows  the  multibody  sys¬ 
tem  to  undergo  singular  positions.  Fourth,  the  constraint  forces  (Lagrange  multipliers) 
can  be  obtained  as  a  by-product  of  the  integration  without  having  to  integrate  additional 
equations  for  them.  Finally,  the  method  assures  convergence  independent  of  the  penalty 
values  used. 


2.2.  Inverse  Kinematics  and  Inverse  Dynamics 

Gofron  and  Shabana^^  have  proposed  a  solution  to  the  inverse  dynamics  problem  by 
integrating  Eq.  (12)  directly  and  solving  for  the  joint  actuation  from  the  Lagrange  multi¬ 
pliers  thus  obtained.  This  method  leads  to  a  causal  solution  which  relies  on  the  presence 
of  damping  forces  in  order  to  obtain  a  stable  solution.  In  contrast,  the  method  presented 
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in  this  paper  does  not  rely  on  damping  to  produce  a  stable  solution.  Moulin  and  Bayo 
recognized  that  because  of  the  non-minimum  phase  character  of  the  inverse  problem,  the 
unique  stable  solution  should  be  non-causal,  i.e.,  actuation  is  required  before  the  end¬ 
point  has  started  to  move  as  well  as  after  the  end-point  has  stopped.  These  findings  have 
been  corroborated  by  Paden  and  Chen^^  in  their  theoretical  work  on  the  inversion  of  non¬ 
linear  non-minimum  phase  systems  such  as  flexible  multibodies.  The  proper  integration 
is  of  crucial  importance  in  obtaining  non-causal  solutions,  and  as  previously  demon¬ 
strated  in  the  planar  inverse  dynamics  problem,  the  time-anticipatory  effect  can  be 
automatically  obtained  by  integrating  in  the  frequency  domain^  or  in  the  time  domain  by 
using  the  non-causal  impulse  response  function  and  the  bilateral  Laplace  transform. 

A  previously  proposed  solution  to  the  three-dimensional  inverse  dynamics  prob- 
lem^'^  relied  on  a  pinned-free  finite  element  model  of  a  flexible  beam,  and  the  equation 
for  the  inverse  dynamics  torque  was  formulated  by  imposing  the  condition  that  the  tor¬ 
sional  deformation  and  the  two  transverse  deformations  of  the  free  end  of  each  link  be 
zero  throughout  the  motion.  This  limited  type  of  model  led  to  a  recursive  scheme  to  solve 
the  inverse  dynamics  of  flexible  multibody  systems,  and  is  found  suitable  for  open-chain 
configurations  but  not  for  closed-chain  configurations. 

In  this  section,  we  describe  an  iterative  Lagrangian  procedure  to  solve  the  three- 
dimensional  inverse  dynamics  problem  for  either  open-chain  or  closed-chain  topologies. 
Our  overall  strategy  is  to  first  solve  the  inverse  kinematics  problem,  i.e.,  finding  the  unk¬ 
nown  rigid  body  coordinates  q^.  and  flexible  body  displacements  ,  given  the  desired 
end-point  coordinates  as  explicit  functions  of  time.  Having  determined  the  correct  gen¬ 
eralized  coordinates  and  their  time  derivatives,  the  inverse  dynamics  joint  torques  can  be 
obtained  explicitly  from  the  equations  of  motion.  Compared  to  the  recursive  procedure 
cited  above,  this  new  approach  is  more  systematic  and  becomes  the  only  choice  when 
closed-chain  systems  are  encountered.  We  model  the  elastic  links  under  pinned-pinned 
boundary  conditions.  Furthermore,  since  torsional  deformations  cause  deviations  from 
the  nominal  configuration  further  down  the  chain,  we  model  the  elastic  link  as  fixed  with 
respect  to  torsion  at  the  distal  end  of  the  link. 
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Our  goal  then  is  to  formulate  an  inverse  kinematics  equation  that  is  linearized  about 
the  nominal  motion,  so  that  the  elastic  displacements,  which  are  non-causal  with  respect 
to  the  end-point  motion,  can  be  determined  through  a  frequency  domain  analysis.  This  is 
possible  only  if  the  leading  matrix  of  the  linearized  equation  is  time-invariant  and  if  the 
forcing  term  is  Fourier  transformable.  This  objective  has  been  achieved  in  the  planar  case 
with  the  use  of  reference  coordinates  for  the  rigid  body  variables  to  describe  the  position 
and  orientation  of  the  floating  reference  frame. 

The  three-dimensional  inverse  kinematics  problem  presents  additional  difficulties 
not  found  in  the  planar  case.  First,  unlike  the  planar  case,  the  three-dimensional  torque 
vectors  change  directions  in  time,  so  that  the  external  force  vector  Qe  in  Eq.  (11) 
becomes  a  nonlinear  function  of  the  rigid  body  orientation  coordinates.  To  overcome  this 
difficulty,  a  proper  parametrization  of  the  rigid  body  coordinates  and  proper  bases  for  the 
joint  torques  are  necessary  to  attain  the  stated  objectives  in  forming  the  linearized  inverse 
kinematics  equations.  As  described  later  in  this  section,  the  desired  form  of  the  linearized 
inverse  kinematics  equation  is  possible  if  Euler  parameters  are  used  to  describe  the  rigid 
body  orientation  and  if  the  base  torque  vector  of  each  multibody  component  is  expressed 
in  terms  of  components  along  the  associated  floating  reference  frame. 

A  second  difficulty  that  appears  in  the  three-dimensional  inverse  dynamics  problem 
is  that  the  end-point  vibration  in  the  lane  defined  by  the  revolute  joint  axis  and  the 
member  axis  can  not  be  controlled  by  the  torque  applied  at  the  revolute  joint.  ITiis  sug¬ 
gests  that  additional  actuation  is  necessary  to  control  the  end-point  motion  when  the  mul- 
tibody  system  reaches  an  "inaccessible"  configuration.  This  problem  has  been 
addressed  in  Reference  14  wherein  one  motor  at  each  intermediate  joint  and  three  motors 
at  the  ground  were  proposed  to  control  the  end-point  motion  for  all  possible 
configurations  of  a  certain  class  of  open-chain,  flexible  multibody  systems.  The  problem 
of  "inaccessibility"  in  open-chain  systems,  however,  can  be  completely  avoided  simply 
through  a  judicious  design  of  the  orientation  of  the  joint  motors  so  that  end-point  vibra¬ 
tion  is  controllable  for  all  possible  configurations.  For  closed-chain  systems,  "inaccessi¬ 
ble"  configurations  do  not  occur,  hence  the  controllability  of  the  end-point  motion  is 
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assured  without  the  need  for  extra  actuation. 


Consider  again  the  system  equations  of  motion  expressed  by  Eq.  (11).  For  a  typical 
multibody  component,  say  body  i ,  the  equations  of  motion  can  be  written  in  the  follow¬ 


ing  partitioned  form: 
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The  elements  of  the  mass  matrix  and  quadratic  velocity  force  vector  corresponding  to  an 
isoparametric,  three-dimensional  curved  beam  finite  element  are  given  in  the  Appendix. 

Let  us  define  x‘  as  the  torque  vector  at  the  base  of  body  i ,  whose  three  components 
tj,  and  x/  are  parallel  to  the  associated  floating  reference  axes  r^,s^,  and  respec¬ 
tively.  If  we  use  Euler  parameters  as  the  rigid  body  orientation  coordinates,  the  externally 
applied  joint  forces  Qje  associated  with  the  rigid  body  rotation  of  body  i  can  be 
expressed  as 


Qie  =  [G‘  F  Cx*  -  [A‘  F  A'-^i  x*-"!; 


(15) 


where  x*  is  the  base  torque  acting  on  body  i  and  whose  components  are  parallel  to  the 
floating  reference  axes  associated  with  body  i;  X'"*’^  is  the  vector  of  joint  torques  and 
reaction  moments  transmitted  from  body  i  to  body  j+1,  and  whose  components  are 
parallel  to  the  floating  reference  axes  associated  with  body  j+1;  A*  and  A*+*  are  body 
axes  to  inertial  axes  rotation  transformation  matrices  for  bodies  i  and  r+l,  respectively; 
and  G‘  is  a  matrix  that  maps  the  derivatives  of  the  Euler  parameters  describing  the  orien¬ 
tation  of  the  reference  frame  of  body  i  to  the  angular  velocity  of  this  reference  frame, 
and  is  given  by  G‘  =  2  E‘  .  Cbmbining  Eq.  (15)  with  the  second  set  of  equations  in  Eq. 
(14)  yields 

[G‘]^  ^x‘  -  [AM^  A‘+i  x‘+*;  =  R‘  +  m^e  0‘  +  q/ 

+  X  —  Qid- 

If  we  pre-multiply  both  sides  of  Eq.  (16)  by  -^G*  and  use  the  identity 


(16) 


-  12- 


|GGJ'=13  (17) 

where  I3  is  the  3  a:  3  identity  matrix,  we  can  get  the  following  result  after  expanding  the 
inertia  matrices  and  quadratic  velocity  vectors  found  in  Eq.  (16),  (the  reader  is  referred  to 
the  Appendix  for  the  expressions  for  the  inertia  matrices  and  quadratic  velocity  vectors  in 
terms  of  the  invariants  of  the  motion): 

x‘  =  [A‘F  -t-  -^G‘  R‘  +  J‘  G‘  9'  +  J/  q/  +  -J-G'  X 

+  -^G*  [G‘  ]7’  G‘  0‘  +  J/  q/;  (18) 

where  J*  is  the  3  x  3  inertia  tensor  of  body  i  with  respect  to  the  origin  of  the  floating 
reference  frame  and  measured  relative  to  this  frame,  and  J/  is  the  inertia  matrix  coupling 
the  rigid  body  rotation  and  the  clastic  deformation.  The  key  to  obtaining  a  time-invariant 
leading  matrix,  that  is  necessary  in  transforming  the  linearized  equations  of  motion  into 
the  frequency  domain,  is  the  fact  that  the' inertial  coupling  matrix  J/  can  be  decomposed 
into  the  sum  of  a  time-invariant  matrix  and  a  time-varying  matrix,  i.e., 

(19) 

where  J/c  and  J/t  are  the  time-invariant  pan  and  time-varying  part  of  J/,  respectively. 
This  decomposition  is  essential  to  the  formulation  of  the  inverse  kinematics  equations 
that  lead  to  non-causal  solutions  to  the  nonlinear  inversion  problem.  This  is  also  the  rea¬ 
son  for  selecting  Euler  parameters  as  rigid  body  orientation  coordinates  over  other  types 
of  singularity-free  coordinates  such  as  natural  coordinates,^*  where  the  decomposition  of 
the  inertial  coupling  matrix  into  time-invanant  and  time-varying  parts  is  multiplicative 
rather  than  additive  as  in  Eq.  (19).  Introducing  this  decomposition  into  Eq.  (18)  results  in 
the  following  expression  for  the  base  torque  on  body  i : 

x‘  =  [A‘ A*'*’*  x'"^*  +  Ti  -t-  J/c  q/ 


(20) 


-  13- 


where  is  a  torque  vector  given  by 

=  |G‘  m^R  R‘  +  J‘  G'  9'  +  J/,  q/  -h  |G‘  <t>$  X 

-h-^G'  [G'f  fJ‘  G‘  0‘^J/q/;.  (21) 

Considering  the  equations  of  motion  associated  with  the  elastic  degrees  of  freedom, 
the  externally  applied  force  vector  due  to  the  joint  torques  acting  on  body  i  can  be 
expressed  as 

Qi/  =  NJ  x'  -  [A‘  (22) 

where  Nj,  and  N,  are  the  shape  function  matrices  associated  with  a  torque  vector  acting 
on  node  b  (base)  and  at  node  t  (tip)  of  the  finite  element  mesh,  respectively.  Combining 
Eq.  (22)  with  the  third  set  of  equations  in  Eq,  (14)  yields  the  following  inverse  kinemat¬ 
ics  equations  for  body  i : 

iTi//  q/  c//  q/  +  k//  q/  =  (23) 

where  the  modified  mass  matrix  is  given  by 

m}f  =  mff  -  NJ’  J/^  (24) 

and  the  motion-induced  force  vector  acting  on  the  elastic  degrees  of  freedom  is  given  by 
F‘  =  ^[A‘  f  A'-"‘  Ti;  -  Nf  [A‘  A*-^‘  x*'+' 

+  QV -^<5  R‘ -m/ee*.  (25) 

The  modified  mass  matrix  is  nonsymmetric  and  it  is  precisely  this  nonsym¬ 
metry  that  produces  elastic  displacements  which  are  non-causal  with  respect  to  the  end¬ 
point  motion  when  non-causal  techniques  are  employed  to  obtain  the  proper  inversion  of 
the  nonlinear,  non-minimum  phase  systems.  Furthermore,  inspection  of  Eqs.(23)-(25) 
shows  that  the  inverse  kinematics  equation  for  body  i  assumes  that  the  base  torque  vector 
x'"*"*  is  known  beforehand.  This  suggests  some  form  of  recursive  algorithm  for  the  inverse 
kinematics,  i.e.,  finding  the  elastic  displacements  starting  from  the  end-point,  and 
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proceeding  to  the  base  of  the  multibody  system  (inboard  direction).  This  procedure  is 
straightforward  for  open-chain  configurations.  However,  for  closed-chain  configurations, 
we  need  to  take  the  additional  step  of  cutting  the  chain  at  the  joint  that  is  defined  as  the 
end-point,  and  then  proceed  as  in  the  open-chain  case,  since  the  constraint  forces  at  the 
cut  are  automatically  accounted  for  by  the  vector  of  Lagrange  multipliers. 


The  nonlinear  inversion  can  now  be  carried  out  efficiently  in  the  frequency  domain 
since  the  leading  matrices  have  been  constructed  such  that  they  remain  constant 
throughout  the  motion.  Our  strategy  is  to  solve  Eq.  (23)  in  the  frequency  domain  to 
obtain  the  nodal  deformation  vector  q/  that  is  non-causal  with  r*,spect  to  the  end-point 
motion.  In  the  frequency  domain,  Eq.  (23)  can  be  written  as  a  set  of  complex  equations 
for  a  particular  frequency  to 


q/(co)  =  F‘(co) 


(26) 


where  q/(to)  is  the  Fourier  transform  of  q/(r)  and  F‘((o)  is  the  Fourier  transform  of 
F‘  (r ).  Eq.  (26)  is  based  on  the  assumption  that  q/(t )  and  F*  (r)  are  Fourier  transformable. 
This  assumption  is  valid  for  slewing  motions  which  are  from  rest  to  rest.  Tne  nodal 

A 

acceleration  vector  q^  (co)  can  be  obtained  directly  from  Eq.  (26)  for  each  frequency  (O. 
The  leading  matrix  of  Eq.  (26)  is  a  complex  regular  matrix  that  is  invertible  for  all  fre¬ 
quencies  except  for  to  =  0.  However,  for  co  =  0,  the  system  undergoes  a  rigid  body 
motion,  and  the  leading  matrix  will  be  determined  only  by  niff  which  is  positive  definite 
and  therefore  invertible.  We  note,  however,  that  the  forcing  vector  on  the  right  hand  side 
of  Eq.  (26)  depends  on  the  elastic  deformations,  velocities  and  accelerations.  Therefore, 
an  iterative  process  is  needed  to  obtain  the  solution  to  the  differential  equations  which 
are  nonlinear  in  q^.  We  start  the  iteration  process  by  assuming  zero  elastic  deformations, 
velocities  and  accelerations  for  the  initial  calculation  of  the  forcing  vector  F*  (t ),  and  use 
a  successive  substitution  scheme  to  converge  to  the  correct  solution.  Finally,  the  elastic 
displacements  and  their  derivatives  in  the  time  domain  may  be  obtained  through  the 
application  of  the  inverse  Fourier  transform,  e.g.. 


Alternately,  the  computation  of  rne  elastic  displacements  and  their  derivatives  in 
each  iteration  can  also  be  carried  out  in  the  time  domain  through  the  use  of  the  non- 
causal  impulse  response  function  and  the  bilateral  Laplace  transform,  e.g., 

q/(r)=  7  (28) 

-oo 

where  hy  (r)  is  the  non-causal  acceleration  response  vector  to  an  impulse  applied  to  the 
degree  of  freedom  and/j(r)  is  the  component  of  the  forcing  term  on  the  right 
hand  side  of  Eq.  (23).  We  note  that  the  integration  from  -oo  to  oo  is  necessary  to  capture 
the  non*causal  effects. 

Once  the  non-causal  elastic  displacements  and  their  derivatives  are  known,  Eq.  (18) 
can  be  used  to  explicitly  compute  the  non-causal  inverse  dynamics  joint  efforts  that  will 
move  the  end  effector  according  to  a  desired  trajectory.  We  note,  however,  that  the  joint 
torques  and  elastic  displacements  given  by  Eqs.  (18)  and  (23),  respectively,  depend  on 
the  Lagrange  multipliers  and  rigid  body  coordinates,  which  in  turn  depend  on  the  elastic 
displacements  and  the  applied  torque.  Moreover,  the  rigid  body  coordinates  ana 
Lagrange  multipliers  are  different  from  their  nominal  values  when  the  components  of  the 
multibody  system  are  flexible.  Therefore,  a  forward  dynamic  analysis  is  required  to 
obtain  an  improved  estimate  of  the  generalized  coordinates  and  Lagrange  multipliers.  In 
order  to  ensure  that  the  iteration  process  converges  to  obtain  the  joint  efforts  that  will 
cause  the  end-effector  to  follow  the  desired  trajectory,  the  forward  dynamics  analysis  is 
carried  out  with  the  additional  constraint  that  the  coordinates  of  the  end-point  follow  the 
desired  trajectory.  These  additional  constraints  have  corresponding  Lagrange  multipliers 
which  act  as  correcting  terms  to  the  joint  efforts  that  have  been  previously  calculated.  We 
also  note  that  this  iteration  process  takes  into  account  the  effect  of  the  nonlinear  coupling 
between  the  rigid  body  coordinates  and  the  deformation  coordinates  in  the  computation 
of  the  joint  efforts.  The  iterative  procedure  is  found  to  be  convergent  for  multibody 
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systems  with  moderately  flexible  members  and  with  moderate  rotation  rates. 

To  summarize,  the  procedure  for  obtaining  the  inverse  dynamics  solution  for  three- 
dimensional,  flexible  multibody  systems  involve  the  following  steps: 

Algorithm: 

1.  Perform  a  rigid  body  inverse  dynamic  analysis  to  obtain  the  nominal 
values  of  the  rigid  body  coordinates  q,  and  Lagrange  multipliers  X. 

2.  Solve  the  inverse  kinematics  equation  in  the  frequency  domain  through 
Eq.  (23)  or  in  the  time  domain  through  Eq.  (28)  to  obtain  the 
time-delayed  elastic  displacements  and  their  time  derivatives. 

3.  Compute  the  inverse  dynamics  joint  efforts  i  using  Eq.  (18). 

4.  Perform  a  forward  dynamic  analysis  using  Eqs.  (12)  and  (13)  to  obtain  new 
values  for  the  generalized  coordinates  and  Lagrange  multipliers. 

5.  Repeat  steps  2  through  4  until  convergence  in  the  inverse  dynamics 
torques  is  achieved. 

It  is  worthwhile  to  compare  the  recursive  procedure  proposed  in  Reference  14  and 
the  algorithm  proposed  in  this  paper.  The  most  important  difference  between  the  two 
methods  is  that  the  former  method  assumes  that  the  dependence  of  rigid  body  coordinates 
on  the  elastic  displacements  are  made  negligible  through  the  action  of  control  forces  so 
that  the  rigid  body  coordinates  take  on  values  corresponding  to  the  nominal  motion.  This 
assumption  is  not  made  in  the  present  method  and  consequently,  the  solution  of  the 
inverse  kinematics  equation  of  Eq.  (23)  would  require  an  iteration  for  the  rigid  body 
coordinates  q^.  as  well  as  the  Lagrange  multipliers  X  that  are  needed  as  inputs  to  the 
inverse  kinematics  equation.  A  consequence  of  the  above  assumption  in  the  previously 
proposed  recursive  procedure  is  that  control  inputs  were  required  at  all  intermediate 
joints  in  th^  rnultibody  system.  This  requirement  is  acceptable  in  open-chain 
configurations,  but  not  practical  in  closed-chain  configurations  because  the  number  of 
system  degrees  of  fireedom  is  less  than  the  number  of  joints  in  a  closed-chain  multibody 
system.  The  present  procedure  takes  advantage  of  this  fact  and  allows  the  analyst  to 
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choose  a  priori  which  joints  in  the  multibody  system  are  the  control  joints.  Therefore,  the 
present  algorithm  is  more  general  and  more  systematic  than  the  previously  proposed  pro¬ 
cedure,  although  it  requires  more  computational  effort. 

3.  Simulation  Results  and  Discussion 

We  present  in  this  section  some  results  of  the  numerical  implementation  of  the  pro¬ 
cedure  discussed  above.  First,  we  apply  the  procedure  proposed  in  this  paper  to  the 
inverse  dynamics  of  a  two-link,  open-chain  flexible  multibody  system  undergoing  motion 
in  three-dimensional  space,  and  compare  the  results  with  those  obtained  by  the  previ¬ 
ously  proposed  recursive  procedure.'"^  Next,  we  present  some  simulation  results  of  the 
application  of  the  present  procedure  to  the  inverse  dynamics  of  a  closed-chain,  flexible 
multibody  system  undergoing  three-dimensional  motion. 

3.1.  Open-Chain  Multibody  System 

The  iterative  procedure  discussed  in  the  preceding  section  is  applied  to  the  three- 
dimensional  open-chain  flexible  manipulator  shown  in  Fig.  3.  The  multibody  system  is 
controlled  by  three  motors  at  the  base  and  one  motor  at  the  intermediate  revolute  joint. 
The  desired  motion  is  to  have  the  end-point  remain  in  the  xi-x-i  plane  with  the  j:2  coordi¬ 
nate  and  ^3  coordinate  of  the  end-point  following  the  trajectories  shown  in  Fig.  4.  Gravi¬ 
tational  forces  are  neglected.  The  two  links  share  the  following  geometric  and  material 
properties: 

Length:  \.0m 

Cross  section  dimensions:  1.0  cm  at  1.0  cm 
Young’s  modulus;  70  GPa 
Shear  modulus:  27  GPa 
Mass  density:  2115  kg Im^ 

■  Tip  mass:  0.1  kg 
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We  perform  two  sets  of  computations  for  the  example  considered:  1)  using  the 
recursive  procedure  proposed  in  Reference  14;  and  2)  using  the  iterative  procedure  pro¬ 
posed  in  this  paper.  In  the  open-chain  case,  each  intermediate  joint  needs  to  be  con¬ 
trolled,  and  we  therefore  expect  very  similar  results  from  both  methods.  Plots  of  inverse 
dynamic  joint  torques  needed  to  track  the  desired  end-point  trajectory  are  shown  in  Figs. 
5a  and  5b.  The  results  obtained  from  the  two  methods  superimpose,  thus  validating  the 
method  proposed  in  this  paper.  Plots  of  the  corresponding  rigid  body  torques  are  also 
shown  in  the  figures  to  illustrate  the  non-causal  nature  of  the  inverse  dynamic  torques.  In 
Figs.  5a-b,  the  dashed  curves  refer  to  the  inverse  dynamic  torques,  while  the  solid  curves 
refer  to  the  rigid  body  torques.  Transverse  deflections  induced  by  the  motion  at  third 
points  in  the  two  links  are  shown  in  Figs.  6-7.  In  these  figures,  the  dashed  curves  are 
transverse  deflections  caused  by  the  inverse  dynamic  torques,  while  the  solid  curves  are 
deflections  caused  by  the  rigid  body  torques.  In  Fig.  6,  one  vibration  mode  with  a  fre¬ 
quency  of  approximately  30  Hz  dominates  the  response,  while  in  Fig.  7,  two  vibration 
modes  at  frequencies  of  approximately  3  A  Hz  and  30  Hz  dominate  the  response.  The 
higher  frequency  coiresponds  to  the  first  bending  mode  of  a  single  link  and  characterizes 
the  so-called  "fast  subsystem"  while  the  lower  frequency  corresponds  to  the  "slow  sub¬ 
system"  formed  by  the  assembly  of  the  multibody  components.  Rayleigh  damping  was 
used  in  the  numerical  simulation,  with  damping  coefficients  of  0.4%  and  3.8% 
corresponding  to  frequencies  of  3.4  Hz  and  30  Hz ,  respectively.  We  observe  that  the 
inverse  dynamic  torques  minimize  the  residual  structural  vibratior .  that  would  otherwise 
be  present  if  rigid  body  torques  were  used  to  actuate  the  flexible  multibody  system. 

3.2.  Closed-Chain  Multibody  System 

Fig.  8  shows  a  closed-chain,  three-dimensional  flexible  multibody  system,  where 
the  selected  control  torques  are  shown  in  the  figure.  Joints  1-4  are  revolute  joints  while 
joint  5  is  a  spherical  joint  The  desired  end-point  (joint  5)  trajectory  is  a  motion  in  the 
xi-x-^  plane  with  the  Xi  coordinate  and  x-i  coordinate  of  the  end-point  following  the  tra- 
jec-tories  shown  in  Fig.  9.  As  in  the  open-chain  case,  gravitational  forces  are  not 
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considered  in  order  to  focus  on  the  inertial  effects  on  the  dynamics  of  the  system.  The 
four  links  share  the  following  geometric  and  material  properties: 

Length:  1.0  m 

Cress  sect’on  dimensions:  1.0  cm  x  1.0  cm 
Young’s  modulus:  40  GPa 
Shear  modulus:  15  GPa 
Mass  density:  2715  kglm^ 

Tip  mass:  0. 1  kg 

The  present  procedure  is  applied  to  the  closed-chain  system  by  introducing  a  cut  at 
the  end-point  (joint  5),  thus  creating  two  open-chain  systems.  The  internal  constraint 
forces  exposed  by  the  cut  are  automatically  taken  into  account  by  the  Lagrange  multi¬ 
pliers  in  the  equations  of  motion.  Figs.  10a  and  lOb  show  joint  torques  Ti  and  T^,  respec¬ 
tively,  that  are  needed  to  achieve  the  desired  end-point  trajectory.  In  these  figures,  the 
dashed  curves  refer  to  the  inverse  dynamic  torques  obtained  by  the  present  procedure, 
while  the  solid  curves  refer  to  the  corresponding  rigid  body  torques.  Figs.  11  and  12 
show  the  transverse  deflections  at  a  third  point  in  link  #2,  obtained  from  a  feedforward  of 
the  inverse  dynamic  torques  (dashed  curve)  and  the  corresponding  deflection  obtained 
from  a  feedforward  of  the  rigid  body  torque  (solid  curve).  Again,  we  observe  that  the 
inverse  dynamic  torques  minimize  the  residual  structural  vibrations  that  are  otherwise 
present  when  rigid  body  torques  are  used  to  actuate  the  system.  It  is  also  interesting  to 
look  at  the  Lagrange  multipliers  that  represent  the  reaction  forces  between  multibody 
components.  Fig.  13  shows  a  typical  Lagrange  multiplier  associated  with  the  inverse 
dynamic  torques  (dashed  curve)  and  the  corresponding  nominal  Lagrange  multiplier 
associated  with  the  rigid  multibody  system  (solid  curve).  We  observe  that  pre-actuation 
and  post- actuation  are  also  exhibited  by  the  Lagrange  multipliers  in  an  inverse  dynamics 
calculation. 
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4.  Conclusion 

We  have  presented  a  new  iterative  procedure  for  determining  the  inverse  dynamic 
torques  that  are  needed  for  end-point  trajectory  tracking  in  three-dimensional  flexible 
multibody  systems.  An  iterative  procedure  is  necessary  because  of  the  interdependence 
between  the  elastic  coordinates,  the  rigid  body  coordinates  and  the  associated  Lagrange 
multipliers  in  the  system  equations  of  motion.  This  procedure  is  valid  for  both  open- 
chain  and  closed-chain  configurations,  and  differs  fi-om  the  previously  proposed  recursive 
procedures  in  the  sense  that  the  rigid  body  coordinates  are  not  assumed  to  follow  the 
nominal  motion.  The  conditions  for  trajectory  tracking  arc  now  met  in  a  more  general 
way  through  the  satisfaction  of  rheonomic  constraint  conditions.  The  new  method  is 
shown  to  yield  the  same  results  as  those  obtained  with  the  recursive  procedures  for 
open-chain  systems  with  normal  link  flexibilities  and  normal  rotation  rates.  For  closed- 
chain  systems,  however,  this  new  method  is  the  only  valid  procedure  for  determining  the 
inverse  dynamic  torques  since  in  this  case,  the  number  of  control  torques  is  smaller  than 
the  number  of  joints  and  therefore,  the  recursive  methods  can  not  be  applied. 

Further  research  is  needed  to  address  the  inverse  dynamics  problem  wherein  the 
contribution  of  the  quadratic  force  vector  Qv/  to  the  generalized  elastic  forces  is  consid¬ 
erable  enough  to  yield  errors  in  the  feedforward  control  law.  This  case  arises  when  the 
rotation  rates  are  high  or  when  the  structural  components  are  extremely  compliant.  It  has 
been  reponed  that  this  problem  can  be  mitigated  by  introducing  damping  into  the  struc¬ 
ture.^^  An  issue  that  still  needs  to  be  resolved  is  how  to  introduce  distributed  actuation 
that  may  necessary  beyond  that  provided  by  structural  damping  that  is  inherent  in  the 
material.  In  a  separate  paper,  we  have  addressed  this  issue  through  the  use  of  electros- 
trictive  actuators  that  are  distributed  along  the  span  of  the  structure. 
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Appendix 

The  elements  of  the  mass  matrix  and  quadratic  velocity  force  vector  in  Eq.  (14)  can 
oe  expressed  in  terms  of  the  so-called  invariants  of  the  motion  which  need  to  be  com¬ 
puted  only  once  at  the  start  of  the  simulation.  For  each  component  of  the  flexible  multi¬ 
body  system,  the  invariants  of  the  motion  can  be  expressed  by  the  following  integrals: 


Zi  =  |ur  p  dV 
Z2  =  j^Npdl/ 


(Ur\+Ur\)  -Ur^Ur,  -  Ur^Ur, 

Z3  =  f  -  (Ur]  +  Ur])  “  Urji^ 

-Ur,Ur^  -  Ur,Ur,  {Ur]  + 'ir\) 

ZiJ  - 1  Ur,  Nj  pdV,  i , ;  =  1 ,2,3 
=  pdV,  /,;  =  1,2,3 


pdV 


(A.l) 

(A.2) 

(A.3) 

(A.4) 

(A.5) 


where  p  is  the  mass  density,  V  is  the  volume  of  the  component,  N  is  the  shape  function 
matrix,  and  Ny  is  the  j‘^  row  of  the  shape  function  matrix.  We  observe  that  the  motion 
invariant  Zj  is  a  measure  of  the  first  moment  cf  the  undeformed  component  about  the 
body  axes,  and  the  motion  invariant  Z3  is  the  inertia  tensor  of  the  undeformed  component 
with  respect  to  the  body  axes. 

Closed-form  expressions  for  the  motion  invariants  corresponding  to  the  three- 
dimensional,  Bemoulli-Euler  straight  beam  element  are  given  in  Reference  15.  In  this 
paper,  however,  we  use  the  variable-node,  isoparametric,  three-dimensional  curved  beam 
element  developed  by  Bathe  and  Bolourchi^^  to  model  the  flexible  links.  As  a  result,  the 
motion  invariants  can  be  expressed  in  terms  of  integrals  which  are  evaluated  numerically 
through  Gaussian  quadrature. 

The  components  of  the  mass  matrix,  expressed  in  terms  of  the  invariants  of  the 
motion  are  given  by  the  following: 
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=m  I3 

(A.6) 

m/je  =  -  A  S  G 

(A.7) 

niRf  =  A  Z2 

lA.8) 

m00  =  G2^JG;  J  =  [Z3  +  Ji+J2] 

(A.9) 

mQf=G^3f  ',  J/=[J/c+J/f] 

(A.  10) 

in//=Z^UZ^2  +  z^3 

(A.ll) 

where,  in  Eq.  (A.6),  m  is  the  total  mass  of  the  component,  and  the  tilde  symbol  above  the 
vector  in  Eq.  (A. 7)  refers  to  the  skew- symmetric  matrix  operator.  The  matrices  S,  Jfc, 
Jft.Ju  and  J2  are  given  by 


S  =  Zi  Z2  q/ 

J/c  ~ 


J/r 


Jl  = 


ZP-Zi2 

Z^i  -  zp 
zp-zp 


q/(Z^-Z^2) 

q'/(Z^i-Z^3) 

qfiZP-ZP) 


(P22+P33)  -;^i2  ~Pi3 

~P2l  (P1I+P33)  ~P23 

-P2l  -P32  (PII+P22) 


(A.12) 

(A.l.^) 

(A.  14) 

(A.15) 


J2  = 


(<722 +  <733)  -‘712  ~<7i3 

-<721  (^11 -*-^33)  -<723 

-^31  -<732  (^11 +<722) 


(A.16) 


in  which 


Pij=[Z^+Zr]qf,  iJ  =  lX3 


(A.  17) 


and 


<7/>=q/z^q/.  /,;■  =  1,2,3. 


(A.18) 


(A.  19) 


The  quadratic  velocity  force  vectors  are  given  by 
QvR  =  -  a  /'co^  S  +  2  to  Z2  q/j> 

Qv0  =  ~  2  fj  CO  +  J/  (If)  (A. 20) 

Qv/  =  -  [Zi*  +  2^2  +  zPf  -  [0)2]*  mff  q;  -  [2  co]*  m/f  q/  (A.21) 

where  O)  is  the  absolute  angular  velocity  of  the  body  axes,  whose  components  ar**, 

expressed  with  respect  to  the  body  axes  and  given  by 

CO  =  G  0  (A.22) 

and  the  matrices  [co^]*  and  [2  co]*  are  block  diagor  J  matrices  whose  diagonal  elements 

are  co^  and  2  co,  respectively. 
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Fig.  8:  A  three-dimensional,  flexible  closed-chain  system 

Fig.  9:  Nominal  end-point  coordinates  for  the  closed-chain  system 

Fig.  10;  Rigid  (solid)  vs.  flexible  (dashed)  torques:  (a)  joint  torque  T2;  (b)  joint  torque 

Fig.  11:  Link  #2  transverse  deflection  along  s-axis:  rigid  (solid)  vs.  flexible  (dashed) 
Fig.  12;  Link  #2  transverse  deflection  along  t-axis;  rigid  (solid)  vs.  flexible  (dashed) 
Fig.  13:  Comparison  of  Lagrange  multipliers:  nominal  (solid)  vs.  flexible  (dashed) 
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ABSTRACT 

This  paper  addresses  the  problem  of  inverse  dynamics  for  three- 
dimensional  flexible  manipulators  with  both  lumped  and  distributed  acnia- 
tors.  A  recursive  procedure  is  presented  for  computing  the  lumped  inverse 
dynamic  torques  and  the  distributed  piezoelectric  actuator  inputs  for 
simultaneously  tracking  a  prescribed  end-point  trajectory  and  reducing 
induced  vibrations  in  the  manipulator.  The  procedure  sequentially  solves 
for  the  non-causal  inverse  dynamic  torques  and  piezoelectric  voltages 
applied  to  each  link  in  the  manipulator,  starting  from  the  last  element  in 
the  chain  and  proceeding  to  the  base  element.  The  method  allows  trajec- 
toiy  tracking  wherein  controllability  of  the  structural  vibrations  is  assured 
in  all  possible  configurations  through  the  use  of  only  one  motor  at  each 
intermediate  joint  and  three  motors  at  the  ground.  Numerical  simulation 
shows  that  the  elastic  vibrations  can  be  reduced  significantly  through  the 
use  of  distributed  actuators  while  at  the  same  time  satisfying  the  trajectory 
tracking  requirement  through  the  use  of  inverse  dynamics. 
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1.  Introduction 

The  control  of  flexible  manipulators  is  becoming  a  more  important  area  of  research 
as  more  stringent  demands  are  placed  on  these  multibody  systems.  For  example,  current 
developments  in  orbiting  space  manipulators  and  cranes  require  their  different  com¬ 
ponents  to  be  positioned  accurately  in  order  to  fulfill  mission  requirements.  In  most 
cases,  the  control  objectives  for  these  structures  are  end-point  tracking  in  a  slewing 
maneuver  and  minimizing  the  structural  vibrations  that  result  from  the  slewing  motion. 
Several  approaches  towards  these  objectives  have  been  suggested  from  different  seg¬ 
ments  of  the  scientific  community.  Some  researchers  have  suggested  the  improvement  of 
the  dynamic  properties  of  the  stmeture  through  the  use  of  composite  materials  tailored 
for  higher  damping  capabilities  along  with  shape  optimization  to  maximize  the  stiffness 
to  mass  ratio.  ^  A  second  approach  is  to  implement  feedback  control  on  the  joint  actuators 
through  the  use  of  a  variety  of  control  strategies  (the  reader  is  referred  to  Reference  2  for 
a  survey  of  the  state  of  the  art  in  the  control  of  flexible  manipulators).  Recent  studies  in 
this  area  which  include  valuable  experimental  results  have  been  reported  in  References 
345  and  6.  Thirdly,  the  use  of  distributed  active  control  memben  such  as  piezoceramics 
to  damp  out  the  elastic  vibrations  has  been  proposed^  Finally,  a  fourth  method  is  to  com¬ 
pute  the  inverse  dynamic  joint  torques  that  will  cause  the  control  point  to  follow  the 
desired  trajectory.®^*®  The  computed  torque  technique  has  been  validated  with  experi¬ 
mental  results  in  the  study  done  by  Paden,  et.  al.}^  wherein  passive  feedback  and  feed¬ 
forward  of  the  inverse  dynamic  torques  were  used  to  achieve  an  exponentially  stable 
tracking  control  law  in  flexible,  multi-link  systems.  However,  it  is  desirable  to  not  only 
track  a  desired  trajectory  but  also  minimize  the  subsequent  el^^tic  deformations  in  the 
structure.  To  achieve  this  goal,  we  combine  the  third  and  fourth  approaches  and  present 
in  this  paper  a  new  scheme  for  simultaneous  trajectory  tracking  and  vibration  minimiza¬ 
tion  for  open-chain  flexible  articulated  structures.  The  major  contributions  of  this  paper 
are  the  extension  of  the  inverse  dynamics  formulation  to  three-dimensional  open-chain 
manipulators  and  the  combined  application  of  inverse  dynamics  and  distributed 
piezoelectric  actuation  to  track  trajectories  and  reduce  vibrations  simultaneously. 

Trajectory  tracking  in  planar  flexible  articulated  struemres  has  been  addressed  by 
Bayo,®  Bayo  and  Moulin,^  and  also  by  Kwon  and  Book.*®  The  researchers  cited  above 
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have  considered  the  non-minimum  phase  character  of  the  system  when  solving  for  the 
inverse  dynamic  torques  that  are  required  for  end-point  trajectory  tracking  in  slewing 
motions  that  are  confined  in  a  plane.  The  results  of  the  above  studies  lead  to  the  conclu¬ 
sion  that  the  unique  stable  solution  for  the  inverse  dynamic  torques  are  non-causal  with 
respect  to  the  end-point  motion.  This  non-causality  can  be  captured  either  through 

o 

integration  in  the  frequency  domain,  or  by  computing  for  the  inverse  transfer  function  in 
the  frequency  domain  and  subsequently  integrating  in  the  time  domain  through  the  use  of 
bilateral  convolution  integrals,^  or  by  dividing  the  solution  into  causal  and  anticausal 
parts  and  integrating  in  the  time  domain.^®  In  our  opinion,  the  firequency  domain 
approach  involves  a  simpler  formulation  than  the  time  domain  approach,  when  the  equa¬ 
tions  of  motion  can  be  linearized  about  the  nominal  motion  so  that  the  Fourier  transforms 
of  the  deformation  coordinates  and  base  torques  can  be  explicitly  expressed.  Bayo,  et. 
al.}^  extended  the  computed  torque  technique  to  multiple  flexible  links  in  a  planar, 
open-chain  configuration  by  using  a  recursive  method  that  computes  for  the  inverse 
dynamic  torques  starting  from  the  last  component  in  the  chain  and  continuing  to  the  first 
link  at  the  base  of  the  chain.  The  recursive  method  entails  an  inverse  dynamic  analysis 
for  the  non-causal  joint  torques  and  a  forward  dynamic  analysis  for  the  reaction  forces 
between  contiguous  links  in  the  chain.  The  inverse  dynamic  torques  so  determined 
achieve  end-point  tracking  but  do  not  minimize  the  elastic  vibrations.  In  this  paper,  we 
extend  the  application  of  inverse  dynamics  to  simultaneous  trajectory  tracking  and  vibra¬ 
tion  reduction  by  combining  inverse  dynamic  joint  torques  with  distributed  piezoelectric 
actuators  in  three-dimensional,  open-chain  flexible  manipulators.  In  the  present  work,  the 
inverse  dynamic  torques  and  the  piezoelectric  voltages  are  simultaneously  computed  so 
that  the  non-causal  joint  torques  assure  end-point  tracking  while  the  distributed 
piezoelectric  actuators  reduce  the  elastic  vibrations  that  are  induced  by  the  slewing 
motion. 

We  develop  in  the  present  study  a  recursive  computational  algorithm  that  allows 
end-point  trajectory  tracking  of  three-dimensional  open-chain  manipulators  through  the 
use  of  only  one  motor  at  each  intermediate  joint  and  three  motors  at  the  ground,  as 
opposed  to  the  method  proposed  in  Reference  13  where  three  motors  were  proposed  at 
each  joint.  Our  method  achieves  the  trajectory  tracking  requirement  with  the  use  of 
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reduccd  number  of  actuators  and  in  addition,  the  elastic  vibrations  are  reduced  through 
the  use  of  distributed  piezoelectric  actuators.  The  extension  of  the  frequency  domain 
recursive  inverse  dynamics  procedure*^  to  three-dimensional  flexible  manipulators  is  not 
trivial  and  presents  two  main  difficulties.  First,  unlike  the  two-dimensional  case,  the 
three-dimensional  joint  torque  vCwU)rs  charge  direction  in  time  and  become  functions  of 
the  rigid  body  orientation  coordinates.  Consequently,  if  the  torque  vectors  are  expressed 
in  terms  of  components  along  the  inertial  frame  axes,  the  force  vectors  due  to  these 
torques  become  nonlinear  functions  of  rigid  body  orientation  coordinates  and  torque 
components,  for  which  the  Fourier  transforms  may  not  exist  Second,  in  the  three- 
dimensional  case,  the  end-point  vibration  in  the  plane  defined  by  the  revolute  joint  axis  at 
the  base  of  the  link  and  the  member  axis  can  not  be  controlled  by  the  torque  applied  at 
the  revolute  joint 

The  two  previously  mentioned  problems  are  solved  by  taking  the  following  steps  in 
the  formulation  of  the  solution.  First  the  nonlinearity  in  the  forcing  term  due  to  the  joint 
torques  is  avoided  by  expressing  the  base  torque  in  terms  of  components  along  the  float¬ 
ing  reference  frame  associated  with  each  multibody  component  The  base  torques  are 
then  expressed  independent  of  the  rigid  body  configuration,  and  as  a  consequence  these 
torques  are  Fourier  transformable.  Secondly,  having  expressed  the  base  torques  in  this 
manner,  the  bi-axial  bending  and  torsional  deformations  are  decoupled  when  deforma¬ 
tions  from  the  nominal  motion  are  small.  Hence,  one  of  the  base  torque  components 
described  above,  which  refen  to  the  joint  torque  applied  to  the  revolute  joint  controls  the 
end-point  deformation  in  one  direction,  while  the  remaining  two  torque  components  can 
be  treated  as  a  reaction  tonional  moment  which  controls  the  end-point  tonional  deforma¬ 
tion  and  a  reaction  bending  moment  which  controls  the  lateral  deformation  in  the  other 
direction,  respectively.  These  reaction  moments  are  treated  as  unknowns  which  are  to  be 
determined  along  with  the  inverse  dynamic  joint  torques.  Here,  it  is  important  to  note 
that  in  the  three-dimensional  case,  the  torsional  deformation  at  the  end-point  needs  to  be 
controlled,  since  the  torsional  deformation  will  result  in  a  displacement  firom  the  nominal 
configuration  further  down  the  chain. 

The  recursive  procedure  can  then  be  formulated  as  follows.  The  last  multibody 
component  at  the  end  of  the  chain  is  analyzed  first  to  determine  the  actuating  torque  and 
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reaction  moments  at  the  base  of  that  muiubody  component.  Each  of  the  other  com¬ 
ponents  are  subsequently  analyzed  for  the  base  torques  at  the  proximal  end  given  the 
desired  reaction  forces  and  reaction  moments  at  the  distal  end.  The  actuating  torques  at 
the  base  of  the  chain  can  then  be  determined  by  a  simple  projection  of  moments,  where 
the  projection  matrix  depends  on  the  nominal  rigid  body  orientation  coordinates. 
Through  die  use  of  the  proposed  recursive  procedure,  the  end-point  trajectory  tracking 
problem  of  three-dimensional,  open-chain  flexible  manipulators  can  be  accomplished  by 
using  only  one  motor  at  each  intermediate  joint  and  three  motors  at  the  ground.  In  con¬ 
trast  with  other  proposed  approaches  that  rely  on  three  motors  at  each  intermediate 
joint,^^  the  use  of  only  one  motor  at  each  intermediate  joint  for  control  purposes  minim¬ 
izes  the  inertial  forces  acting  on  the  system,  thereby  increasing  the  speed  under  which  the 
system  can  effectively  operate. 

2.  Mathematical  Formulation 

In  this  section,  we  formulate  the  problem  and  present  a  solution  of  simultaneous  tra¬ 
jectory  tracking  and  vibration  reduction  for  three-dimensional  open-chain  flexible  mani¬ 
pulators.  The  problem  formulation  starts  with  the  equations  of  motion  for  each  com¬ 
ponent  of  an  n-link  manipulator  that  is  'mdergoing  motion  in  three  dimensional  space. 
The  combined  objectives  of  trajectory  tracking  and  vibration  reduction  are  Lhea 
expressed  in  the  form  of  a  minimization  problem  that  is  suitable  for  numerical  computa¬ 
tion.  The  solution  procedure  presented  in  this  section  is  a  recursive  procedure  which 
solves  for  the  required  actuating  torques  and  piezoelectric  voltages  starting  from  the  last 
multibody  component  at  the  end  of  the  chain  and  proceeding  to  the  first  component  at  the 
base  of  the  chain. 

2.1.  Problem  Formulation 

Consider  an  n-link,  open-chain  flexible  manipulator  shown  in  Fig.  1.  The  recursive 
procedure  developed  in  this  paper  consists  in  analyzing  each  multibody  component  for 
the  piezoelectric  voltages  applied  across  the  distributed  actuators  and  the  base  torques  at 
the  proximal  end,  given  the  inertial  forces  due  to  the  nominal  motion  and  the  reactions 
coming  from  the  next  component  at  the  other  end.  A  typical  multibody  component,  say 
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bcxly  i ,  is  shown  in  Fig.  1  along  with  the  floating  reference  frame  associated  with  that 
body.  The  generalized  coordinates  consists  of  rigid  body  coordinates  which  describe 
the  position  and  orientation  of  the  floating  reference  frame  associated  with  each  multi¬ 
body  component,  and  deformation  coordinates  q/  which  describe  the  deformation  of  the 
flexible  body  with  respect  to  its  floating  reference  frame.  The  deformation  from  the  nom¬ 
inal  configuration  is  assumed  to  be  small,  so  that  the  different  bending  and  torsional 
modes  are  decoupled.  Using  the  aforementioned  coordinates,  the  equations  of  motion  for 
a  flexible  multibody  component  can  be  written  as^^ 

nifr  ^ff  .^4.00  ^+00  Q^r  ,  Qvr  /i'. 

my,  m//  qy  0  Cyy  qy  0  kyy  qy  “  Q^y  Qvy  (  ^ 

where  m„  is  a  configuration-dependent  matrix  representing  the  mass  and  inertia  tensor 
of  the  deformed  body,  associated  with  the  rigid  body  coordinates;  my,  and  m^  are 
configuration-dependent  matrices  representing  the  inertial  coupling  between  the  rigid 
body  coordinates  and  deformation  coordinates;  and  myy,  cyy,  and  kyy  are  the  consistent 
finite  element  mass,  damping,  and  stiffness  matrices,  respectively.  The  force  vector  Q* 
represents  the  applied  external  forces,  control  forces,  and  reaction  forces  coming  from 
adjacent  multibody  components,  while  the  force  vector  Qy  represents  the  quadratic  velo¬ 
city  force  vector  which  includes  centrifugal  forces  and  Coriolis  forces.  Geometric  stiffen¬ 
ing  due  to  high  rotation  rates  can  also  be  added  to  the  vector  Qyy . 

The  dependence  of  the  rigid  body  coordinates  on  the  deformation  coordinates  can 
be  mitigated  by  assuming  that  we  can  find  control  forces  so  that  the  rigid  body  coordi¬ 
nates  follow  the  nominal  motion  and  therefore,  the  resulting  equations  of  motion  become 

"*//  V  ®//  V  **//  V  ~  Q'’/  ~  '"A  ^ 

where  q,  takes  values  along  the  nominal  motion.  Furthermore,  the  external  force  vector 
Q^f  can  be  decomposed  into  applied  torques,  equivalent  moments  coming  from  the  dis¬ 
tributed  piezoelectric  actuators  and  reaction  forces  coming  from  adjacent  multibody 
components,  i.e., 

Q^y  Vp -t-R  (3) 

where  B^t  and  Bp  are  constant  matrices  that  describe  the  placement  of  the  motors  and 
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piezoelcctric  actuators,  respectively;  x  is  the  vector  of  base  torques  measured  with 
respect  to  the  floating  reference  frame;  is  the  vector  of  applied  piezoelectric  voltages; 
and  R  is  the  vector  of  reaction  forces  coming  from  adjacent  multibody  components.  The 
reaction  force  vector  R  consists  of  three  force  components  and  three  torque  components 
acting  at  both  ends  of  the  muitibody  component.  The  force  and  torque  components  are 
taken  along  the  body  axes  associated  with  the  multibody.  As  shown  in  Fig.  2,  these  force 
and  torque  components  are  transferred  from  body  j  to  body  i  through  the  rotation 
transformation 

&  =  A‘>  d  (4) 

and 

x‘  =  A‘>  V  (5) 

where  &  and  x*  are  the  reaction  forces  and  reaction  moments,  respectively,  measured 
along  the  body  axes  of  body  i ,  and  A‘^  is  the  rotation  transformation  matrix  which  pro¬ 
jects  forces  and  moments  from  body  j  to  body  i .  The  transformation  matrix  A'-'  depends 
on  the  nominal  values  of  rigid  body  coordinates  q/  and  q/. 

Following  the  model  proposed  by  Crawley  and  Anderson,*^  the  piezoelectric  actua¬ 
tion  can  be  considered  as  two  self-equilibrating  concentrated  moments  acting  at  the  two 
ends  of  the  actuator.  The  magnitude  of  the  concentrated  moment  is  proportional  to  the 
voltage  applied  across  the  piezoelectric  acmator,  hence 

(6) 

where  the  proportionality  constant  kp  is  a  function  of  the  dimensions  and  material  pro¬ 
perties  of  the  piezoceramic  material  and  the  link  components.  As  an  example,  for  a  3- 
element  model  where  the  middle  element  has  piezoelectric  actuators  attached  on  four 
sides  to  control  bending  in  two  directions,  the  influence  matrix  Bp  can  be  expressed  as 
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where  the  rows  having  the  non-zero  coefficients  correspond  to  the  rotational  degree  of 
freedom  of  the  element  in  which  the  piezoelectric  actuators  are  attached.  Each  column  of 
corresponds  with  a  specific  voltage  of  a  pair  of  actuators.  Hence,  Bp  will  have  as 
many  columns  as  the  number  of  pairs  of  piezoelectric  actuators.  In  general,  the  influence 
matrix  Bp  depend.^  n  the  sizing  and  placement  of  the  distributed  actuators.  Issues  con¬ 
cerning  the  design  of  so-caUed  smart  structures  are  discussed  in  Reference  16.  In  the 
cited  work,  the  researchers  conclude  that  the  optimal  placement  and  sizing  of  piezoeletric 
actuators  depends  on  the  free  vibradon  modes  of  the  flexible  manipulator,  the  frequency 
content  of  the  desired  motion,  and  the  choice  of  vibration  modes  that  need  to  be  con¬ 
trolled.  As  a  general  rule,  distributed  actuators  are  most  effective  in  controlling  the  vibra¬ 
tion  modes  which  do  not  have  nodes  near  the  distributed  actuator. 

There  are  two  important  reasons  for  expressing  the  base  torques  in  terms  of  com¬ 
ponents  along  the  floating  reference  frame.  First,  the  influence  matrix  becomes  a  con¬ 
stant  Boolean  matrix  because  each  of  the  base  torque  components  is  associated  with  a 
specific  rotational  deformation  degree  of  freedom.  Constant  influence  matrices  B-i  and  B^ 
are  necessary  in  order  to  obtain  the  non-causal  base  torques  and  piezoelectric  voltages  in 
the  frequency  domain  asvwill  be  seen  in  the  following  section.  Second,  the  base  torque 
components  which  are  associated  with  the  torsional  monMnt  and  the  two  bending 
moments  are  independent  of  each  other  if  the  coiresponding  modes  of  deformation  are 
decoupled,  as  in  the  case  of  small  deformation  from  the  nominal  configuration.  Hence, 
the  influence  matrix  B-j  has  independent  columns,  and  this  property  is  useful  in  finding 
the  solution  to  the  minimization  problem  described  in  the  next  section. 


-9- 


Combining  Eqs.  (2)  and  (3),  the  equations  of  motion  for  each  multibody  component 
become 

*"//  9/  +  c//  q/  +  ky/  q/  =  t  +  Bp  Vp  +  F  (8) 

where 

F  =  R  +  Qv/ q,.  (9) 

Having  derived  the  equations  of  motion  for  each  multibody  component,  the  problem 
statement  that  this  paper  addresses  can  be  stated  as  follows.  Given  the  nominal  motion  of 
an  open-chair,  flexible  manipulator,  we  wish  to  find:  1)  the  inverse  dynamic  torques  that 
will  cause  the  end-point  of  each  multibody  component  to  follow  the  nominal  motion;  and 
2)  the  piezoelectric  voltages  that  will  minimize  the  induced  elastic  vibrations  during  the 
motion.  The  problem  of  simultaneous  trajectory  tracking  and  vibration  reduction  can  be 
stated  mathematically  as  the  following  minimization  problem: 

where  f  is  the  set  of  all  pairs  of  stable  joint  torques  and  distributed  actuator  voltages  that 
cause  the  end-point  to  follow  the  nominal  motion,  andy(T,Vp)  is  a  measure  of  the  elastic 
vibrations  and  defined  as 

y('C,Vp)=  J  q^(r)^q/(r)dr.  (11) 


2.2.  Solution  Procedure 

The  minimization  problem  described  in  the  previous  subsection  presents  some 
unique  features  that  aro  associated  with  non-minimum  phase  systems  such  as  flexible 
manipulators.  The  requirement  that  the  pair  (T,Vp )  should  cause  the  end-point  to  follow 
the  nominal  trajectory  admits  only  non-causal  solutions  as  stable  solutions  to  the  minimi¬ 
zation  problem. As  demonstrated  by  Bayo,  et.  the  recursive  frequency  domain 
approach  can  be  employed  to  capture  the  non-causal  (time-anticipatory)  nature  of  the 
actuating  torques  in  the  case  of  multi-link,  planar  manipulators.  In  our  approach,  each 
multibody  component  is  modeled  by  a  pinned-free  beam  and  the  requirement  that  the 
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end-point  follows  the  desired  trajectory  is  satisfied  by  imposing  the  condition  that  the 
elastic  displacement  at  the  tip  is  zero.  In  the  three-dimensional  case,  however,  we  have  to 
impose  the  additional  constraint  that  the  elastic  torsional  deformation  at  the  tip  is  zero, 
because  torsional  deformations  would  cause  displacements  from  the  nominal 
configuration  funher  down  the  chain. 

In  the  frequency  domain,  Eq.  (8)  can  be  expressed  as  a  set  of  complex  equations  for 
a  particular  frequency  cu 

^ 

where  q^(o)),  -cloj),  Vp(o)),  and  F(co)  are  the  Fourier  transforms  of  q/(r),  T(f ),  (r),  and 

F(f),  respectively.  Eq.  (12)  is  based  on  the  assumption  that  the  aforementioned  Fourier 
transforms  exist,  an  assumption  which  is  valid  for  rest  to  rest  slewing  motions.  We  also 
take  note  of  the  fact  that  the  Fourier  transforms  of  the  base  torques  and  applied 
piezoelectric  voltages  can  be  explicitly  expressed  because  their  respective  influence 
matrices  are  constant  The  leading  matrix 

H(c3)*  m// +  jLc// -  (13) 

is  a  complex  regular  matrix  that  is  invertible  for  all  firequencies  except  for  co  =  0.  How¬ 
ever,  for  0)  =  0,  the  system  undergoes  a  rigid  body  motion,  and  the  leading  matrix  will  be 
determined  only  by  m//  which  is  positive  definite  and  therefore  invertible.  Making  use 
of  the  fact  that  the  leading  matrix  is  invertible  for  all  frequencies,  Eq.  (12)  can  be 
expressed  in  the  following  partitioned  form: 


Qa*  Qht 
<i»(fil)  ’  =  Qih  Qu  Oif 
q^(o))  Grt  Gn  Gb 


l3  -  - 


th(SS)\ 


0  Kcoj-t-  Bp,  Vp(ci))+  F.(a) 
01  1r_|  lF,(a,)| 


where  G(a))  is  the  inverse  of  ri[(ci))  and  I3  is  the  (3  x  3)  identity  matrix.  The  subscript  h 
refers  to  the  rotational  degrees  of  freedom  at  the  hub,  the  subscript  t  refers  to  the  defor¬ 
mation  degrees  of  freedom  at  the  tip  which  are  to  be  controlled,  and  the  subscript  t  refers 
to  the  remaining  elastic  degrees  of  freedom.  The  expression  for  the  influence  matrix  B-t 
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on  the  right  hand  side  of  Eq.  (14)  makes  use  of  the  fact  that  each  of  the  components  of 
the  base  torque  vector  t  is  associated  with  a  specific  rotational  deformation  degree  of 
freedom  and  is  independent  of  the  other  components. 

The  condition  that  the  tip  should  follow  the  nominal  motion  is  equivalent  to  impos- 
g  the  constraint  q,(a))  =  0  for  all  co.  This  constraint  results  in  a  relationship  between  the 
base  torques  z  and  the  distributed  piezoelectric  actuator  inputs  .  This  relationship  can 
be  obtained  from  the  last  set  of  equations  of  Eq.  (14)  when  (i(ci))  is  set  to  zero,  hence 
giving  the  following  result: 

T(co)  =  -GfA“^  Gth  Gti  Gti  "I  Bp  V^(a))  +  F(co)  -  (15) 

where  the  existence  of  the  inverse  of  G,*  is  assured  when  the  torsional  deformation  and 
the  bi-axial  bending  modes  are  decoupled,  which  is  consistent  with  the  assumption  of 
small  deformation  from  the  nominal  configuration.  Substituting  the  above  expression  for 
the  base  torque  t(Ci))  in  Eq.  (13)  and  using  the  Fourier  transform  property  q  =  -co^(J 
yields  the  following  expression  for  the  elastic  displacements  in  compact  form: 

<i/(co)*-^(AV^+B)  (16) 

where 

A  =  [  -  G  B,  (Gf*  G„-  Gff)  +  G]  (17) 

and 

B  =  ( -  G  G,a-‘  (Grt  Grt  G„)  +  G]  (18) 

Employing  Paneval’s  theorem,  minimizing  7  (T,V^)  is  equivalent  to  minimizing  the 
2- norm  //4/(ci))//|  for  each  co.  The  minimization  problem  of  Eq.  (10)  then  reduces  to  a 
standard  least  squares  approximation  problem  with  the  solution 

=  -  (A*A)-1  A*  B  (19) 

where  A*  denotes  the  conjugate  transpose  of  A.  A  necessary  and  sufficient  condition  for 
the  inverse  of  A*A  to  exist  is  that  ail  the  columns  of  the  constant  matrices  B^  and  B^  are 
independent.  This  condition  is  automancally  satisfied  because  the  base  torques  and  the 
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equivalent  concentrated  moment  due  to  the  distributed  piezoelectric  actuators  do  not  act 
on  the  same  degree  of  freedom  in  the  finite  element  model. 

Having  determined  the  piezoelectric  voltages  V(a))  for  each  frequency  w  from  Eq. 
(19).  the  base  torques  x(co)  and  the  elastic  displacements  4/(co)  can  be  determined  from 
Eqs.  (15)  and  (16),  respectively,  for  the  same  frequency.  The  base  torques,  elastic  dis¬ 
placements,  and  piezoelectric  voltages  do  not  need  to  be  determined  for  the  frequency 
03  =  0  because  the  zero  frequency  content  of  these  variables  can  be  determined  from  the 
zero  initial  conditions.  The  base  torques,  elastic  displacements,  and  the  piezoelectric  vol¬ 
tages  in  the  time  domain  can  then  be  obtained  by  using  their  respective  inverse  Fourier 
transforms 

OO 

x(0  =  -^f  T(03)e‘“d0}  (20) 

q/(0  = I  <l/(03)e‘“  dto  (21) 

-OO 

and 

Vp(f)  =  -2^7  V^(u)«‘“d5.  (22) 

We  note,  however,  that  the  forcing  vector  F(r)  in  Eq.  (8)  depends  on  the  elastic 
nodal  deformations  and  nodal  velocities.  Therefore,  an  iteration  p.ocess  is  needed  to 
obtain  the  solution  to  the  nonlinear  differential  equations.  We  stan  the  iteration  process 
by  assuming  zero  elastic  deformations  and  velocities  for  the  initial  calculation  of  the 
forcing  vector  F(r)  and  use  a  successive  substitution  scheme  to  converge  to  the  correct 
solution.  For  normal  robotics  applications,  convergence  is  achieved  in  two  or  three  itera¬ 
tions. 

Once  the  base  torques  and  the  elastic  deformations  have  been  determined,  the  reac¬ 
tions  coming  from  the  next  multibody  component  in  the  proximal  direction  can  be  deter¬ 
mined  from  dynamic  equilibrium  considerations.  The  reaction  forces  between  com¬ 
ponents  will  generally  consist  of  three  force  components  and  three  torque  components. 
As  shown  in  Fig.  2,  the  three  torque  components  at  the  base  for  body  j  consist  of  a 
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revolute  joint  actuating  torque  t,.  a  reaction  bending  moment  T,  that  controls  the  end¬ 
point  displacement  5f  in  the  plane  defined  by  the  revolute  joint  axis  and  the  component's 
body  axis,  and  a  reaction  torsional  moment  t,  which  controls  the  end-point  torsional 
deformation  dr  for  this  multibody.  Having  determined  the  reaction  forces  and  moments 
between  this  multibody  and  the  next  multibody  component  in  the  proximal  direction,  the 
latter  multibody  can  be  analyzed  using  the  inverse  dynamics  procedure  discussed  above. 
This  multibody  component  has,  at  its  distal  end,  the  previously  computed  reaction  forces 
and  reacuon  moments  coming  from  the  previous  multibody  component.  These  forces  and 
moments  are  transferred  firora  the  previous  multibody  component  to  the  present  multi¬ 
body  by  a  projection  of  forces  and  moments,  where  the  projection  matrix  depends  on  the 
rigid  body  coordinates.  The  inverse  dynamics  analysis  therefore  involves  a  procedure 
which  analyzes  each  multibody  body  component,  starting  from  the  end  of  the  chain  and 
proceeding  towards  the  base  of  the  chain.  For  each  multibody  component,  the  case 
torques  are  determined  such  that  the  end-point  follows  the  desired  trajectory  and  in  addi¬ 
tion,  the  reaction  forces  and  moments  at  the  end-point  are  in  dynamic  equilibrium  with 
the  previously  computed  base  torques  and  base  forces  of  the  adjacent  multibody  com¬ 
ponent  in  the  distal  direction.  Finally,  the  base  torques  of  the  multibody  component  at  the 
base  of  the  chain  are  projected  onto  a  reference  frame  that  defines  the  joint  axes  of  the 
three  motors  at  the  ground.  The  three  torque  components  that  result  from  this  projection 
are  the  required  base  motor  torques  at  the  ground. 

To  summarize,  the  procedure  for  obtaining  the  inverse  dynamic  torques  ar  distri¬ 
buted  piezoelectric  voluges  that  will  simultaneously  track  a  desired  end-point  trajectory 
and  minimize  elastic  vibrations  in  open-chain  flexible  manipulators  involve  the  following 
steps: 
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Algorithm: 

1.  Define  the  nominal  motion  (rigid  body  inverse  kinematics), 

2.  For  each  link  in  the  chain,  starting  from  the  last  link: 

a)  Determine  the  piezoelectric  voltages  for  this  link. 

b)  Determine  the  base  torques  for  this  link. 

c)  Determine  the  elasdc  displacements,  velocities  and  accelerations. 

d)  Repeat  steps  (a),  (b),  and  (c)  until  the  base  torques  and 
voltages  converge  to  within  a  desired  tolerance. 

e)  [determine  the  reaction  forces  to  the  next  link  by  considering 
dynamic  equilibrium  for  this  link. 

3.  Proceed  to  the  next  link  in  the  chain. 

The  recursive  approach  presented  above  solves  the  trajectory  tracking  problem  one 
link  at  a  dme.  In  general,  there  may  be  global  solutions  to  the  nonlinear  inversion  protv 
lem  associated  with  end-effector  trajectory  tracking,  which  can  potentially  have  many 
solutions.  Our  method,  however,  yields  a  solution  which  can  be  efficiently  computed  due 
to  the  linearization  process  that  takes  place  when  the  ends  of  each  link  are  constrained  to 
move  along  their  respective  nominal  trajectories. 

In  the  next  section,  we  consider  an  example  of  a  class  of  spatial,  open-chain,  flexi¬ 
ble  manipulaton  where  all  intermediate  joints  are  revolute  joints  and  all  intermediate 
joint  axes  are  parallel  in  the  nominal  configuration.  For  this  class  of  flexible  manipula¬ 
tors,  at  least  three  motors  are  required  at  the  ground  in  order  to  achieve  end-point  track¬ 
ing  for  all  possible  configurations.  We  illustrate  the  preceding  statement  through  a  simple 
example  involving  a  single-link  flexible  manipulator  supported  by  a  rotating  base  and 
controlled  by  only  two  motors  at  the  ground,  as  shown  in  Fig.  3.  When  the  manipulator  is 
in  the  vertical  position,  the  end-point  displacement  5t  can  not  be  controlled  by  two 
ground  mourn  alone.  This  is  so  because  as  the  manipulator  gets  closer  to  the  vertical 
position,  the  torque  that  is  required  to  control  this  displacement  component  becomes  so 
large  that  it  saturates  the  capacity  of  the  ground  motor.  Likewise,  when  the  manipulator 
is  in  the  horizontal  position,  the  end-point  rotational  displacement  0^  can  not  be  con¬ 
trolled  by  the  two  moton  alone.  Hence,  a  minimum  of  three  motors  at  the  ground  is 
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necessary  to  assure  that  the  end-point  elastic  displacements  are  controllable  for  all  possi¬ 
ble  configtirations.  The  requirement  of  having  three  motors  at  the  ground  is  not  restrictive 
because  it  does  not  increase  the  mass  that  is  distributed  on  the  links,  and  therefore  this 
requirement  does  not  increase  the  inertial  forces  acting  on  the  system.  The  simple  algo¬ 
rithm  presented  herein,  and  the  minimal  number  of  motors  required  to  implement  the  tra¬ 
jectory  tracking  procedure  should  make  the  proposed  approach  very  attractive  in  the 
design  and  experimentation  of  spatial,  open-chain,  flexible  manipulators. 

3.  Simulation  Results  and  Discussion 

The  recursive  procedure  discussed  in  the  preceding  section  is  applied  to  the  three- 
dimensional  open-chain  flexible  manipulator  shown  in  Fig.  4  to  demonstrate  the  validity 
of  the  proposed  procedure  for  solving  the  simultaneous  end-point  trajectory  tracking  and 
vibration  reduction  problem.  The  desired  modon  is  to  have  the  end-point  remain  in  the 
X2-Xi  plane  with  the  xi  coordinate  and  X3  coordinate  of  the  end-point  following  the  tra¬ 
jectories  shown  in  Fig.  5.  The  two  links  share  the  following  geometric  and  material  pro¬ 
perties: 

Length:  1.0  m 

Cross  section  dimensions:  1.0  cm  x  1.0  cm 
Young’s  modulus:  70  GPa 
Shear  modulus:  27  GPa 
Mass  density:  2715  kg /m^ 

Tip  mass:  0.5  kg 

The  piezoelectric  actuators  are  distributed  uniformly  on  the  middle  third  span  of 
each  link.  This  placemens  of  the  distributed  actuators  assures  that  the  first  mode  of  vibra¬ 
tion  is  minimized.  For  the  example  considered  herein,  the  equivalent  concentrated 
moment  that  results  from  the  applied  voltage  to  the  piezoelectric  actuator  is 

Mp=0.1Vp.  (23) 

We  perform  two  sets  of  computations  for  the  example  considered:  1)  inverse 
dynamic  torques  acting  alone;  and  2)  inverse  dynamic  torques  applied  together  with 
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distributed  piezoelectric  actuators.  We  can  compare  the  results  of  the  two  separate 
numerical  experiments  to  illustrate  the  effectiveness  of  the  distributed  actuators  in 
minimizing  elastic  vibrations  while  simultaneously  tracking  a  desired  end-point  trajec¬ 
tory. 

Plots  of  the  inverse  dynamic  motor  torques  needed  to  track  the  desired  end-point 
trajectory  are  shown  in  Figures  6-9.  In  these  plots,  the  solid  curve  refers  to  the  inverse 
dynamic  torques  that  are  needed  when  they  are  acting  alone,  and  the  dashed  curves  refer 
to  the  required  inverse  dynamic  joint  torques  that  are  acting  together  with  piezoelectric 
actuators.  We  take  note  of  the  observation  that  the  inverse  dynamic  torque  profiles  exhi¬ 
bit  pre-actuation  and  post-actuation  with  respect  to  the  end-point  motion  for  both  sets  of 
curves.  We  also  note  that  the  inverse  dynamic  torques  are  only  slightly  perturbed  by  the 
presence  of  the  distributed  piezoelectric  actuators. 

Figures  10  and  11  show  the  elastic  deformations  along  the  two  transverse  directions 
at  the  proximal  third  point  in  each  of  the  two  links.  The  solid  curve  refers  to  the  elastic 
deflection  caused  by  the  joint  actuators  acting  alone,  and  the  dashed  curves  refer  to  the 
elastic  deflection  that  results  when  joint  torques  are  acting  together  with  the  distributed 
piezoelectric  actuators.  The  figures  show  that  the  distributed  actuators  can  significantly 
reduce  the  elastic  vibrations  that  are  induced  by  the  slewing  motion.  Plots  of  the 
piezoelectric  voltages  that  are  required  to  obtain  the  above  reductions  in  the  elastic  vibra¬ 
tions  are  shown  in  Figure  12. 

4.  Conclusion 

We  have  presented  a  new  recursive  computational  procedure  for  determining  the 
inverse  dynamic  torques  wd  distributed  piezoelectric  actuator  voltages  that  are  needed  to 
simultaneously  achieve  end-point  trajectory  tracking  and  vibration  minimization  in 
three-dimensional,  open-chain  flexible  manipulators.  The  trajectory  tracking  requirement 
is  achieved  for  the  three-dimensional  case  with  the  use  of  a  single  actuator  at  each  inter¬ 
mediate  joint,  as  opposed  to  previous  approaches  where  three  actuators  have  been  pro¬ 
posed  at  each  joint  The  non-causal  inverse  dynamic  torques  and  piezoelectric  voltages 
are  obtained  through  integration  in  the  frequency  domain.  The  transformation  of  the 
equations  of  motion  into  the  frequency  domain  is  greatly  simplified  by  expressing  the 
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base  torques  in  terms  of  components  along  the  associated  floating  reference  frame  for 
each  multibody  component.  End-point  displacements  in  the  plane  defined  by  the  associ¬ 
ated  rcvolute  joint  axis  and  the  body  axis  can  be  controlled  by  treating  the  base  torque 
controlling  this  displacement  as  an  unknown  internal  reaction  moment  which  is  to  be 
determined.  Through  a  recursive  procedure,  the  actuating  torques  at  the  proximal  revo¬ 
lute  joints  are  determined  so  that  the  desired  internal  reaction  moments  described  above 
are  attained.  Finally,  the  minimal  number  of  motors  required  at  the  intermediate  joints  to 
implement  the  tracking  control  procedure  substantially  reduces  the  control  effort  and 
weight  of  the  manipulator,  and  this  is  an  important  contribution  in  the  design  and  experi¬ 
mentation  of  spatial,  open-chain,  flexible  manipulators. 

Further  research  is  needed  to  address  the  problem  of  simultaneous  trajectory  track¬ 
ing  and  vibration  minimization  of  general  (open-chain  or  closed-chain)  three- 
dimensional  flexible  manipulators.  The  sensitivity  of  the  method  to  modeling  errors 
needs  to  be  addressed  as  well. 
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Introdnction 

CTIVE  damping  and  control  of  flexible  struaures  has 
been  an  area  of  research  focus  for  some  time.*  However, 
the  recent  application  of  distributed  piezoelectric  actuators  to 
structure  control  by  Crawley  and  De  Luis^  and  Bailey  and 
Hubbard’  has  posed  new  and  challenging  problems.  Following 
the  initial  expeilments  of  these  researchers,  where  a  single 
vibrational  mode  is  controlled,  Fisher^  addressed  the  actuator 
placement  problem  to  control  several  modes. 

In  this  Note  we  address  the  new  problem  of  buckling  control 
using  smart  materials.  In  contrast  to  the  dynamic  stability 
issues  of  vibration  control,  buckling  is  a  static  instability  of 
axially  loaded  members  of  a  structure.  It  is  well  known  that  as 
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the  axial  compressive  load  P  in  an  initially  straight  beam  in¬ 
creases,  the  beam  remains  straight  and  undeformed  until  the 
load  reaches  a  certain  critical  value  Per.i,  where  the  stable 
equilibrium  of  the  Hrst  bending  mode  bifurcates  into  one  un¬ 
stable  and  two  stable  equilibria  (pitchfork  bifurcation).  The 
two  stable  equilibria  correspond  to  buckled  configurations. 

Here  we  use  piezoelectric  actuators  and  strain  gauge  sensors 
to  show  that  buckling  of  a  simply  supported  beam  can  be 
postponed  beyond  the  first  critical  load.  The  load  deflection 
characteristic  for  large  deflections  of  a  beam  in  a  buckled 
configuration  is  highly  nonlinear  and  involves  numerical  solu¬ 
tion  of  elliptic  integrals.  Figure  la  shows  a  typical  load  deflec¬ 
tion  curve  where  P„.„  is  the  buckling  load  of  the  nth  mode. 
If  P<Pa,|,  the  undefleaed  beam  is  stable.  For  P„,„<P 
<Pa,K*i  ^  modes  are  stable  except  for  the  first  n  bending 
modes.  The  idea  reflected  in  this  Note  is  the  use  of  feedback 
control  in  conjunction  with  piezoactuators  to  stabilize  the  first 
bending  mode  beyond  P„.|  and  achieve  a  bifurcation  diagram 
of  the  form  shown  in  Fig.  lb,  where  the  buckling  force  P„.i  is 
greater  than  that  for  the  uncontrolled  beam. 

We  begin  by  deriving  the  linearized  equation  of  motion  and 
the  associated  modal  equations  of  a  simply  supported  flexible 
beam  with  piezoelectric  actuators  subject^  to  slowly  varying 
axial  load.  This  is  followed  by  the  state-space  model  of  the 
reduced  order  system  and  the  design  of  a  controller  to  increase 
the  stiffness  or  impedance  of  the  first  bending  mode.  We  dis¬ 
cuss  the  effect  of  the  unmodeled  higher  order  residual  modes 
and  methods  of  reducing  this  effect.  Our  conclusions  are  made 
in  the  last  section. 
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Fig.  1  Load  deflection  enrve  of  a)  nneontroUed  beam  and  b)  con¬ 
trolled  beam. 


Fig.  2  Simply  supported  column  with  piezoelectric  actnators. 


System  Model  of  a  Beam  with  Piezoactnators 

In  this  seaion  we  use  a  truncated  modal  expansion  of  the 
deflection  of  a  beam  to  derive  a  linear  finite-dimensional 
model.  We  emphasize  that  the  beam  is  assumed  to  be  uniform 
with  no  manufacturing  imperfections.  Since  the  aim  is  to  stabi¬ 
lize  the  beam  in  its  straight  configuration,  it  is  natural  to 
assume  small  deflections  and  linearize  the  equations  of  motion 
about  this  configuration.  Note  that  the  strain  induced  by 
piezoelectric  actuators  is  usually  small  and,  therefore,  the 
small  defleaion  assumption  is  consistent  with  the  capacity  of 
the  aauators. 

Figure  2  shows  a  simply  supported  uniform  beam  with 
piezoelearic  aauators  of  equal  thickness  bonded  to  both  sides 
by  a  suitable  adhesive.  The  beam  of  width  b  and  thickness  tt 
is  subjected  to  an  axial  compressive  load,  and  control  moments 
are  applied  by  the  piezoaauators.  The  aauator  being  modeled 
is  a  piezoelearic  polymer,  poly  vinylidene  fluoride.  For  an 
axially  polarized  piezo,  a  voltage  applied  aaoss  its  thickness 
results  in  strain  aJong  its  length.  For  simplicity  the  width  of 
each  piezolayer  is  assumed  to  be  the  same  as  that  of  the  beam. 

The  strain  A/  developed  in  an  unconstrained  piezo  is  given 
by  A,  =»,(r)d3]/rp  where  v,(r);  1,  2is  the  voltage  applied  to 

the  ;th  piezostrip,  dn  the  piezoelectric  strain  constant,  and  tp 
the  thickness  of  the  piezolayer.  If  vi  and  v:  are  the  voltages 
applied  on  the  top  and  bottom  piezolayers,  resTwrrivelv-  and 
Ep  IS  the  Young’s  uiOdulus  of  the  piezo,  the  resulting  mouient 
on  the  piezobeam  segment  is  given  by 

Af  =  h£prp(A2-A|)(^+f.  +  ^) 

=  bEpd,,{^  +  /,  +  V,)  k  ~  V,)  (1) 

The  equation  of  motion  of  the  beam  can  be  derived  using 
Hamilton’s  variational  principle.^  Under  small  defleaion  as¬ 
sumption  Hamilton’s  principle  yields 

pAy  +  (Ely')’  +  (Py'Y  ^  M[6’ (x  -  Xi)  -  6' (x  -  x^)]  (2) 

where  p  is  the  density  of  the  beam;  y  is  the  transverse  deflec¬ 
tion;  y  and  y '  are  the  time  and  spati^  derivatives  of  y ,  respec¬ 
tively;  El  is  the  stiffness  of  the  beam;  A  is  the  aoss-sectional 
area  of  the  beam;  i'  is  the  spatial  derivative  of  the  delta  func¬ 
tion;  and  Xi  and  xj  are  the  locations  of  the  two  ends  of  the 
piezolayer.  The  solution  to  Eq.  (2)  is  obtained  as  follows. 

Unforced  Dynamics 

The  unforced  system  dynamics  are  deflned  by  the  condition 
Af  =  0  in  Eq.  (2).  Using  separation  of  variables,  we  have 
y„(x./)«0,(x)ij„(r).  Substituting  this  in  Eq.  (2)  and  assuming 


a  simply  supported  uniform  beam  (El  constant)  of  length  L , 
we  obtain 

n,{r)  =  sin(u)„f +  ^1-)  and  <t>„(x)  =  D„%iTi(n-wx/L) 

/t  =  1,2,3...  (3) 

if  is  normalized  so  that  dx  =  1,  the  unforced  dynamics 
in  modal  form  becomes 

y„(x,l)  =  'Jl/L  sin(/iTX/Z.)sin(<i)„/  +  (4) 

To  see  how  the  load  affeas  the  natural  frequencies  of  the 
beam,  substitute  Eq.  (4)  in  Eq.  (2)  and  simplify  to  get 


u 


2 

n 


1  nV 
pA 


(5) 


From  Eq.  (S)  we  see  that  the  nth  pole  pair  ±jmp  moves  along 
the  imaginary  axis  toward  the  origin  as  P  inaeases  from  0 
to  4  Eln}iP/L}  and  becomes  real  for  P^Pa,„.  Hence, 
when  P=Pa,„  the  beam  loses  all  of  its  stiffness  and  buckles  in 
the  nth  buckling  mode. 


Forced  Response 

The  deflection  of  the  beam  can  be  written  in  terms  of  the 
modal  deflections  as 

y{Jf.O=  E 

flm  1 

Substituting  this  in  Eq.  (2)  we  obtain 


2  0A4>n'rip  +  S  +  L 

1  ft*  I  n •  i 


-  A/[6'(x-X2)-5'(x-x,)] 


(6) 


(If  the  piezoelectric  aauators  are  bonded  along  the  total  length 
of  the  beam,  Xi  =  0  and  Xi  =  L.)  Multiplying  both  sides  of 
Eq.  (6)  by  ,  integrating  with  respea  to  x  over  the  beam 
length,  and  using  the  orthogonality  of  the  mode  shapes  yields 


in  +  = 


-2nrAf  fJ 
pAL  yL 


1 

0 


if  n  is  odd 


if  n  is  even 


(7) 


If  >'2=-i'i  =  >>,  we  have  and  Eq.  (7)  becomes 

where 


-An-wk* 

pAL 


if  n  is  odd 

if  n  is  even 


(8) 


Since  the  n  even  modes  are  uncontrollable,  we  expea  the  beam 
to  buckle  in  the  second  mode  when  P^Pa  ^. 

Many  control  problems  aimed  at  vibration  suppression  use 
piezoelectric  materials  as  sensors  in  the  feedback  loop.  How¬ 
ever,  due  to  charge  leakage,  piezoelearics  are  not  useful  as 
sensors  near  0  Hz  as  required  in  this  application.  Therefore, 
resistive  strain  gauge  sensors  are  used  as  modeled  in  the  next 
seaion. 


Sensor  Modding 

Modal  states  are  estimated  from  strain  gauge  measurements 
at  disaete  locations.  It  is  easy  to  see  that  observability  of  the 
modes  of  the  system  depends  on  the  location  of  the  sensors;  a 
mode  with  its  node  at  the  location  of  a  strain  gauge  is  unob¬ 
servable  with  that  sensor.  To  reduce  the  number  of  sensors, 
modal  control  of  flexible  struaures  is  usually  based  on  the  flrst 
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few  modes  of  vibration.  This  is  justified  by  the  fact  that  higher 
vibrational  modes  are  in  general  difficult  to  excite  and  have 
higher  structural  damping.  However,  the  unmodeled  dynamics 
can  cause  instability  through  what  are  known  as  control  and 
observation  spillover.  It  has  been  shown'  that  both  control  and 
observation  spillover  of  unmodeled  modes  are  necessary  to 
cause  instability  in  a  closed-loop  system. 

The  sensors  are  placed  so  that  the  second  and  third  modes 
and  their  multiples  are  unobservable.  Thus  the  first  and  fifth 
modes  are  the  first  two  modes  in  a  minimal  realization  of  the 
system.  We  ignore  higher  order  modes  and  discuss  the  associ¬ 
ated  spillover  problems  later.  If  a  small  amount  of  structural 
damping  is  presei.t,  all  of  the  unobservable  modes  remain 
stable  even  in  the  presence  of  spillover.  Similarly  the  dynamics 
of  the  even  modes  are  not  affected  by  the  control,  and  hence 
they  remain  stable. 

We  model  the  sensors  as  follows.  The  bending  moment  at 
the  location  of  a  strain  gauge,  a  distance  x  from  the  left  end  of 
the  beam,  is  given  by’ 

.Vf»  =  Eft/e,  ^  E 


where  £/,  is  the  Young’s  modulus  of  the  beam  material  and 
is  the  equivalent  moment  of  inertia  of  the  composite  piezo¬ 
beam  segment  based  on  beam  material.  The  resulting  strain  in 
a  strain  gauge  attached  to  one  side  of  the  beam  is 


E'ftfeq  A  eqE^ 


(10) 


where  is  the  equivalent  area  based  on  beam  material.  The 
sign  of  the  first  term  in  Eq.  (10)  depends  on  which  side  of  the 
beam  the  strain  gauge  is  bonded  to.  Therefore,  the  output  of 
a  differential  strain  gauge  is  independent  of  the  load  P  and  is 
given  by 


where  k,  is  the  strain  gauge  constant.  If  differential  strain 
gauges  are  placed  at  x  =  L /3  and  x^2L /3,  and  the  sum  of 
their  measurements  is  taken  as  the  system  output,  we  have 


=  k,  ^  n sin  —  +  sin 


¥) 


(12) 


Controller  Design 

If  n  modal  amplitudes  and  their  rates  are  taken  as  the 
states  of  the  system,  the  state-space  representation  of  the 
2n  dimensional  reduced  order  model  with  a  state  vector 
>ir  =  (Ti  It  Vn  becomes 

T|,  = +  £,»  and  (13) 


first  and  the  fifth  modes  are  included  in  the  reduced  order 
model  and  a  structural  damping  coefficient  f  is  assumed,  then 

0  0 

0  0 

0  1 

“  Utj  ~  2^Ut3 

!  V3  0  1 

I  0  v3 

=  J  r- 

1  -25vl  0 

'  -I 

;  0  -  25V3 

u  J 

(14) 

A  controller  is  designed  using  standard  linear  quadratic  regu¬ 
lator  (LQR)  design  to  minimize  a  cost  funaional  of  the  form 

y  =  j  lcnt^Qyir  +  v^Ry)dt  (15) 

where  Q  and  R  are  positive  semidefinite  and  positive  definite 
weighting  matrices,  respectively;  and  a  is  a  scalar.  The  solution 
of  the  corresponding  Riccati  equation  in  this  method  gives  an 
optimal  state  feedback  solution  of  the  form  »(t)=  -KrVrd), 
where  Kr  is  a  constant  feedback  gain  matrix.  The  following 
parameters  were  used  for  simulation. 

Beam  properties: 


Ar  = 


-a>, 

0 

0 


1 

-  2ftj| 
0 
0 


-4rk 


pAL 


■-i 


CJ 


bi,  =  25.4  mm  f*  »  1  mm  £»  =  5  GPa 
Pi,  -  1CX)0  kg/m’  Lt,  =  152.4  mm 

Piezoactuaor  properties: 

bp  =  bi  fp  -  ^  10  Mttt  Ep  =2  GPa  Lp  =  Lp 

Pp-  1780  kg/m’  dj,  =23x10- ’’m/V 

A  strain  gauge  constant  tk,  =0.01  V/p  strain  and  structural 
damping  coefficient  f = 0.01  are  assumed.  The  thickness  of  the 
adhesive  layer  is  neglected. 

The  optimal  feedback  gain  matrix  K,  is  computed  for 
£=4.1/>„.i  using  Q^CfCr,  R  =  l,  and  a  =  0.05.  The  first 
mode  is  stabilized  at  a  load  exceeding  4Pcr.i  ttnd  at  this 
load  the  uncontrolled  second  mode  is  unstable  (i.e.,  the  beam 
is  forced  to  buckle  in  the  second  mode).  However,  it  remains 
to  check  that  this  controller  stabilizes  the  system  for  loads  less 
than  4. 1  Etr.i .  For  the  reduced  order  system  this  problem  can  be 
reduced  to  checking  the  roots  of  a  fourth-degree  interval  poly¬ 
nomial  with  each  coefficient  varying  monotonically  when  the 
load  varies  from  £  =  0  to  P  =  4.1 The  stability  of  the 
system  under  any  fixed  axial  load  Ps  4.1  Pcr.i  was  verified  by 
checking  the  stability  of  the  two  Kharitonov  polynomials’ 
associated  with  the  fourth-order  characteristic  equation  of  the 
system.  The  same  robustness  result  can  also  be  obtained  nu¬ 
merically  using  a  root  locus  plot  parameterized  by  P. 

The  resulting  closed-loop  response  to  nonzero  initial  condi¬ 
tions  and  the  control  input  voltage  to  the  actuators  for  the 
controlled  model  with  a  load  of  P»3.8P„,i  are  shown  in 
Fig.  3a.  The  effect  of  the  unmodeled  dynamics  and  methods  of 
reducing  this  effect  is  the  subject  of  the  next  section. 


where  A,  is  a  2nx2n  block  diagonal  matrix,  5,  is  a  2n  x  1 
input  matrix,  and  p  is  input  to  the  system.  We  assume  that  the 
derivative  of  the  sensor  output  can  be  computed  and  define 
=  ("o  "ol;  and  C,  is  a  2x 2n  output  matrix.  If  only  the 


Numerical  Evaluation  of  Spillover 

To  see  the  effect  of  spillover,  the  same  gain  K,  is  used  with 
an  extended  evaluation  model  containing  modes  7  and  1 1  in 
addition  to  modes  1  and  5.  To  reduce  the  effect  of  spillover  the 
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Fig.  3  Closed-loop  response  to  nonzero  initinl  deflection 
iP  =  3.8/’cr.i):  1)  reduced  order  model  snd  b)  extended  evalnition 
model. 


controlled  modes  are  reconstructed  from  the  output  of  the 
extended  model  using  a  comb  Tilter  selecting  modes  1  and  S 
(an  observer  serves  this  purpose)  and  these  estimates  are  used 
for  feedback. 

The  resulting  closed-loop  response  of  the  augmented  system 
to  nonzero  initial  conditions  and  the  required  input  voltage  to 
the  actuators  are  shown  in  Fig.  3b.  From  Figs.  3a  and  3b  it  can 
be  seen,  as  expected,  that  there  is  no  significant  effect  of  the 
uncontrolled  modes  on  the  dynamics  of  the  controlled  modes. 

Conclusion 

In  this  Note  we  addressed  the  problem  of  buckling  control 


using  sman  materials,  a  static  instability  of  axially  loaded 
members  of  a  structure.  We  showed  that  the  buckling  of  a 
flexible  beam  can  be  piostponed  beyond  the  first  critical  load 
by  means  of  feedback  using  piezoelectric  aauators  and  strain 
gauge  sensors.  It  is  observed  that  a  controller  design  based  on 
a  fixed  axial  load  stabilizes  the  modeled  modes  for  any 
P^Pmix  and,  therefore,  is  robust  to  slow  load  variations. 
Hence  buckling  in  the  first  mode  is  inhibited,  and  the  beam 
can  support  a  load  up  to  the  second  critical  load.  Actuator 
and  sensor  placement  is  discussed  with  regard  to  problems  of 
spillover.  Finally,  spillover  has  not  posed  serious  problems 
as  we  are  able  to  design  the  controller,  in  the  case  of  a  beam, 
using  a  low-order  model  and  verify  stability  for  a  high-order 
model. 
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Pasadena,  California 

Abstract 

This  paper  addresses  the  inversion  of  a  nonlinear  system 
of  the  form  x=f  (x)+g  (x)u,  y=h(x)  from  the  perspective  of 
nonlinear  geometric  control  theory.  We  use  the  nodon  of  zero 
dynamics  for  obtaining  stable,  though  noncausal,  inverses  for 
nonminimum  phase  systems.  This  contrasts  with  the  causal 
inverses  proposed  by  Hirschom  where  unstable  zero  dynamics 
result  in  unbounded  solutions  to  the  inverse  problem.  Our  results 
reduce  to  those  of  Hirschom  in  the  case  of  stable  zero  dynamics, 
however.  A  numerical  example  is  described  and  the  input  gen¬ 
erated  using  inversion  is  compared  with  that  produced  using 
recent  results  in  nonlinear  reguiadon. 

Key  Words;  inversion,  output  tracking,  nonlinear  systems,  non- 
minimum  phase. 

1.  Introduction 

Output  tracking  control  for  nonlinear  systems  is  a  chal¬ 
lenging  problem  encountered  in  the  control  of  articulated  flexi¬ 
ble  space  structures,  flexible  manipulators  and  elsewhere.  There 
are  two  basic  approaches  to  this  problem.  First,  asymptotic 
tracking  control  using  feedback  and,  second,  inversion  coupled 
with  stabilization.  The  second  approach  offers  the  potendal  of 
exact  output  tracking  without  transients,  but  introduces  the  prob¬ 
lem  of  inverting  nonlinear  systems  with  unstable  zero  dynamics. 
.Motivated  by  new  results  in  the  control  of  ardculated  flexible 
structures  [1,  2.  3],  we  address  the  second  approach  and  solve 
the  inversion  problem  for  a  class  of  nonlinear  systems  with 
unstable  zero  dynamics. 

For  the  linear  multivariable  case,  the  asymptodc  tracking 
problem  was  solved  by  [4.  5]  and  subsequently  crystallzed  as 
the  Lntemal  model  principle  [6].  The  matrix  equadons  defining 
an  asymptotic  tracking  controller  for  linear  systems  were 
translated  to  nonlinear  pardal  differendal  equadons  in  the  non¬ 
linear  case  (see  Isidori  and  Byrnes  (7]).  Although  nonlinear 
PDE's  are  only  numerically  tractable  for  systems  of  low  order, 
soludons  for  tracking  periodic  Tajectories  have  developed  based 
on  Fourier  series  [3,  9].  Transient  response  remains  a  problem, 
however. 

Transient  behavior  can  be  precisely  controlled  using  sta¬ 
bilizing  feedback  together  witn  feed-forward  generated  b''  an 
inverse  system.  For  linear  muldvariable  systems  the  inversion 
problem  has  been  resolved  to  a  large  degree  by  Brockeit  and 
.Mesarovic  [10]  and  Silverman  [llj.  However,  these  inverses 
are  ail  causal.  Condiuons  for  the  invenibility  of  nonlinear  real- 
analytic  systems  have  been  derived  by  Hirschom  (12.  13|.  Here 
again,  only  causal  inversion  is  addressed.  Deeply  rooted  in 
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these  inversion  results  is  the  nodon  of  reladve  degree  which  is 
important  in  our  work  also  (although  a  clear  exposition  of  rela¬ 
dve  degree  for  nonlinear  systems  is  fairly  recent  [14]). 
Although  stable  inversion  has  not  been  studied  in  the  framework 
of  Brockett  and  Mesarovic  and  Hirschom,  approximate  non¬ 
causal  inversion  has  been  used,  for  example,  in  linear  quadradc 
optimal  tracking  [IS],  etc.. 

In  this  chapter  we  derive  bounded  inverses  to  nonlinear 
systems  in  an  effort  to  find  feed-forward  signals  for  tracking 
controllers.  For  systems  with  unstable  zero  dynamics  these  are 
necessarily  noncausal.  These  results  are  new  for  linear  as  well 
as  nonlinear  systems  and  were  modvated  by  the  successful 
inversion  of  a  flexible  manipulator  model  by  Bayo  et  al.  [2] 
using  iteradve  linearization  and  solution  with  the  Discrete 
Fourier  Transform.  Here  we  seek  a  geometric  interpretadon  and 
time-domain  soludon  of  the  inversion  problem  for  the  system 

x=f(.x)^S(x)u 

y=h(x). 

We  show  that  inversion  is  equivalent  to  solving  a  two-point 
boundary  value  problem  in  the  general  case  of  unstable  zero 
dynamics.  The  stable  and  unstable  manifolds  of  the  zero 
dynamics  play  a  central  role  in  the  ideas  developed. 

The  remainder  of  the  chapter  is  organized  as  follows.  In 
the  next  secdon  we  define  the  class  of  reference  trajectories 
under  consideradon  and  state  the  problem  of  stable  inversion. 
Secdon  3  contains  the  main  result  and  shows  that  the  stable 
inversion  problem  reduces  to  a  two-point  boundary  value  prob¬ 
lem  of  reduced-order  ordinary  differential  equations.  Section  4 
contains  an  example  of  a  fourth  order  nonlinear  nonminimum- 
phase  system;  our  stable  Inversion  approach  is  worked  out  in 
detail  and  compared  to  the  approximate  inverse  obtained  from  a 
nonlinear  regulator.  Slmuladon  results  demonstrate  the  value  of 
our  inverse  for  dead-beat  output  tracking. 

2.  Framework  and  Problem  Statement 

We  consider  a  nonlinear  system  of  the  form 

x=f(x)  +  g{x)u  (1) 

y=h{x).  (2) 

defined  on  a  neighborhood  X  of  the  origin  of  R",  with  input 
u  e  R"  and  output  y  6  R'’.  The  functions  f{x),  g,  (x)  (the 
i  th  column  of  g(x))  i  =  1.  2.  ■  ■  ■  .  m  zra  smooth  vector  fields 
and  h;(x)  for  i  =  1.  2.  ■  ■  •  .  p  ate  smooth  functions  on  X ,  wuh 
/(O)  =  0  ind  h(0)  =  0. 
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In  ihe  context  of  the  above  system  pose  the  following 

Stable  Inversion  Problems  Given  a  smooth  reference  output  tra¬ 
jectory  yj(r)  with  compact  support,  find  a  control  input  u^(t) 
and  a  sute  trajectory  x^(t)  such  that 

1)  Ui  and  satisfy  the  differential  equation 

=  (3) 

2)  exact  output  tracking  is  achieved: 

h{xAi))  =  yAc).  (4) 

3)  uj  and  are  bounded  and 

tii(r)— »0,  xj(t)—*0  as  r— »±oo.  (5) 

Note  that  here  we  require  yj(t)  to  have  compact  support, 

that  is,  there  exist  to  and  tf  such  that  y^(r)  =  0  for  all  r  ^  to  and 
ail  t  >tf.  However,  the  development  in  this  chapter  can  be 
extended  with  little  effort  to  cover  desired  trajectories  whose 
first  derivatives  have  compact  support.  The  extension  covers  a 
large  class  of  realistic  trajectories. 

We  call  X4  the  desired  state  trajectory  and  U4  the  nominal 
control  input.  These  can  be  incorporated  into  a  dead-beat  con¬ 
troller  by  using  the  nominal  control  input  as  a  feed-forward  sig¬ 
nal  and  X  ~  X4  as  an  error  signal  for  feedback.  The  design  of 
the  feedback  compensator  has  no  general  solution  yet,  but  con- 
tfoUers  for  specific  systems  have  been  developed  (see  e.g.  Paden 
et  a/.  [1]). 

In  solving  for  the  nominal  trajectories  X4  and  the  con¬ 
cepts  of  stable  and  unstable  manifolds  of  an  equilibrium  point 
arise  naturally  [16],  For  the  sake  of  completeness  we  review  the 
definitions  here.  Let  z  =  0  be  an  equilibrium  point  of  an  auto¬ 
nomous  system  defined  in  a  open  neighborhood  (/  of  the  origin 
of  R’; 

r  =/(r).  (6) 

and  6,(z)  be  the  flow  passing  through  z  at  f  =  0.  We  define  the 
(local)  stable  and  unstable  manifolds  W ,  W*  as  follows: 

W  =  (z  €  £/lb,(z)  e  U  \/i  20.(1, (r)-*0  as  r-*oo)  (7) 
=  (z  €  f/l<S,(z)6  U  Vf  2  0.0,(z)->0asf-*^)  (8) 

The  equilibrium  point  z  =  0  is  said  to  be  hyperbolic  if 
the  Jacobian  matrix  Df  of  /  at  z  =  0  has  no  eigenvalues  on  the 
;  axis.  Let  n'  denote  the  number  of  eigenvalues  of  Df  in  the 
open  left  half  complex  plane,  and  n’  the  number  in  the  open 
right  half  plane.  Stable  and  unstable  manifolds  W  and  W* 
exist  locally  in  the  neighborhood  of  hyperbolic  fixed  point  and 
have  dimensions  n'  and  n“  respectively. 

For  convenience,  we  will  use  the  following  notation.  Let 
N  =  (0.  1.  2.  •  •  •  ),  r  =  (r^,  r-j  •  ■  •  ,  r, )'  s  IN'"  and 

y  ~  ,''a(0,  '  ■  •  .  y™(<)]';  t  s  R.  Then  we  define 


3,  Inversion  of  Partially  Linearizable  Systems 

Consider  a  nonlinear  system  of  the  form  (1)  and  (2)  with 
the  same  number  m  of  inputs  and  outputs  which  we  expand  in 
the  following  form: 

m 

i  =/(*■)  +  'Egi(x)Ui 
i>l 

J-i  =  A,(x) 


y-.  =  h^ix) 

We  assume  that  the  system  has  well-defined  relative 

degree  r  =  (ri,  rj,  •  •  •  .  r„)  at  the  equilibrium  point  0.  that  is, 

(i)  for  ail  I  £  ;  S  /n,  for  all  1  2  ^  m,  for  all  i  <  r;  -  1,  and 
for  all  X  in  a  neighborhood  of  the  origin, 

L,.L}Mx)  =  0.  (11) 

(ii)  the  m  xm  matrix 

L,,L;‘‘‘a,(x)  •••  L,^L/''h,(x) 

L,^Lf^''hi{x)  L,^l;-‘'Aj(x) 

P(*)  =  ...  ...  "...  (12) 

L,Lr'\(x)  L,^Lr'\,ix) 

is  nonsinguiar  in  a  neighborhocxl  of  the  origin. 

Under  this  assumption,  the  system  can  be  partially  linear¬ 
ized.  To  do  this,  we  differentiate  y,-  until  at  least  one  u.-  appears 
explicitly.  This  will  happen  at  exactly  the  r,  th  derivative  of  y,- 
due  to  (11).  Define  qj  -  y,'*  "  **  <  =  L  ■  ■  ■  ,  n  and 
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where 


li  =p<yj,  it.  Ti) 


(23) 


.  r, .  and  denow 


s  =  (^i* 


y\<  ■ 


1  .  . 
'  SI- 


V)' 


yi 


.  >2.  •  ■  • .  yi 


('-n. 


•.V„~  )',  (13) 

Choose  T|,  an  rt  -  I  r  I  dimensional  funcdon  on  R"  such  that 
(q‘,  T)')'  =  \i/(x)  forms  a  change  of  coordinates  with  \i/(0)  =  0 
[14],  In  this  new  coordinate  system,  the  system  dynamics  of 
equadnn  (1)  becomes 


-il  -  ^2 


V;  -  1  = 


for  i  =  I,  • 

^  =  ot,  (q.  -n)  +  P,  (^-n)u 

n  =  Ti)  +  <72(s.  n)«. 

which,  in  a  more  compact  form,  is  equivalent  to 
y''*  =  a(^  n)  +  P(q.  T|)u  , 

^  ■n)  +  <72(4.  ■ 

y  ={yi.y->.  •  ■ .  v-)'. 

u  =  (u„  uj.  ■  ■  ■ ,  u^y, 

“(i  n)  =  ^)). 

|3(^  ri)  =  f  /  ;-'h(r'(^  r\)). 


jn. 


(14) 


(1^) 

(16) 


where 


(17) 

(IS) 


Here  [3  is  acnially  the  same  (5(2:)  matrix  denned  in  the  equation 
(12),  aiO,  0)  =  0  since  /(O)  =  0.  and 

h(x)=  [h,(x).  hr(x).  •  •  •  .  h„(x)l'. 

^(^c)  =  (^i(2r).  ,g2(*).  •  ■  •  .  ,?«(-')l- 

Since  by  the  reiadve  degree  assumption.  (3(>  q)  is  nonsingular, 
the  following  feedback  control  law 


U  =  [P(^  11)]*'(V  -  q)] 


(19) 


is  well  denned  and  partially  linearizes  the  system  such  that  the 
input-output  relationship  is  given  by  a  chain  of  integrators: 

y''>  =  V  (20) 

where  v  s  R"  is  the  new  control  input.  Assume  both  y  and  y^ 
start  from  rest  and  choose 


=  y/’ 


(21) 


Then  immediately  we  have 


a  ,  «■,-')  "2-i> 

=  (yj\.  yai.  •  •  ■ .  yai  .  y^r.  •  • .  yat 


(r 

■  ■  .yjm 


(22) 


and  equation  (16)  becomes,  which  we  call  the  reference  dyrwn- 
xs,  or  the  zero  dynamics  dnven  by  the  reference  output  trajee 
lory. 


P(yd.  q) 

=  ?i(^.  -n)  +  <72(^.  •n)[P(^.'n)l'‘b’J''’  -  af^.n))-  (24) 

It  is  clear  now  that  an  integration  of  the  reference  dynamics 
gives  rise  to  a  trajectory  of  the  original  state  through  the  inverse 

coordinate  transformation  x  =  'F"'(^)  and  an  input  trajectory  by 
equadon  (19).  Now  the  quesdon  is  how  to  integrate  the  refer¬ 
ence  dynamics  to  generate  a  bounded  input  solving  the  stable 
inversion  problem,  since  the  reference  dynamics  may  be 
unstable  in  both  posidve  and  negative  time  direcdons  in  general. 

For  reference  trajectories  with  compact  support,  the  refer¬ 
ence  dynamics  become  autonomous  zero  dynamics  for  r  outside 
the  compact  interval  [ro,  t/1  Assuming  that  q  =  0  is  a  hyper¬ 
bolic  equilibrium  point  of  the  autonomous  zero  dynamics,  then 
there  exist  stable  and  unstable  manifolds  VV"  and  VF*.  Locally 
W*  can  be  defined  by  an  equadon  5“(q)  =  0  and,  similarly.  V/‘ 
can  be  defined  by  5'(q)  =  0.  The  following  theorem  is  our 
main  result. 

Theorem:  The  stable  inversion  problem  has  a  soludon  if  and 
only  if  the  following  two-point  boundary  value  problem  has  a 
soludon: 

li  =piy4>  ■n)> 

subject  to 

B‘(q(f,))  =  0. 

5*(q(V))  =  0. 

Proof:  (necessity)  Suppose  x^^t)  and  «i(f)  solve  the  stable 
inversion  problem.  Then  x^iy)  and  satisfy  the  differential 
equadon  (1).  Let  (q',  q')'  =  Vfx^).  Then  q  and  q  sadsfy  (1)  or 
equivalently  (14)  with  u  subsututed  by  Besides,  since 
y  =  ft(x^)  =  ye  by  assumpdon,  and  y''^  =  yj'^-  There¬ 

fore  by  equadon  (15) 

yj'>  =  a(^,  q) -r  (3(^.  q)Ui. 

which  yields 

Ue  =  [P(^.  q)r‘(yj'’’  -  ct(q<.  T))- 

Substicuung  this  into  the  q  dynamics  of  equation  (165  and  com¬ 
paring  the  resulting  right-hand  side  with  the  definition  of 
p(ye,  rj)  in  equadon  (24),  we  recognize  that  q  sadsfy  equa¬ 
don  (23). 

Now  we  only  need  to  show  that  q  also  satisfies  the  boun¬ 
dary  condidon.  This  is  easy.  Since  by  assumption,  Xe{t)-*0  as 
t—*—oo  and  \t/(0)  =  0,  thus  q(f)— ^  as  f— v-oo  also.  Bv 
definition  of  the  unstable  manifold.  q(f)  s  IV"  for  all  f  <  to- 
Therefore.  fl“(q(to))  =  0-  Similar  arguments  show  that 
3'(q(t/))  =  0. 

(sufficiency)  Suppose  q^  solves  the  above  two  point 
boundary  value  problem.  Then.  q.<  is  bounded,  and  qjit)— •'  as 


(23) 

(25) 
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t— >00  since  TijCr^)  e  IV”  and  T\j(f)— *0  as  t -*-oo  since 
■nj(fo))  €  Also.  =  0  for  f  S  /o  for  r  ^ 

Let 

<4,  =  [P(;j.Ti^)r(>r-ot(^.Ti,)). 

Then,  and  are  bounded,  and  x^(f).  u^Cr)— *0  as  *oo  or 

.-oo.  since  v'iO.  0)  =  0  and  a(0.  0)  =  0.  And  by  ihe 
definition  of  q,  y  =  y^. 

This  completes  the  proof. 

A  geometric  interpretation  of  the  Jt^fr)  evolution  is  shown 
in  Figure  1.  The  noncausai  part  of  the  nominal  control  drives 
the  internal  states  of  the  system  along  the  unstable  manifold  of 
the  zero  dynamics  manifold  to  a  particular  initial  condition 
x^(to)  while  maintaining  zero  system  output.  This  initial  condi¬ 
tion  guarantees  two  things:  1)  the  desired  reference  output  tra¬ 
jectory  is  easily  reproduced  with  bounded  input  and  states;  2) 
the  internal  states  land  on  the  stable  manifold  of  the  zero 
dynamics  manifold  at  the  end  of  luput  tracking.  With  this  nice 
final  condition,  the  internal  states  will  converge  to  zero  along 
the  stable  manifold  without  affecting  the  output.  This  geometri¬ 
cal  picnire  is  shown  in  Figure  1  where  for  clarity  we  showed  a 
case  of  output  slewing  so  that  the  to  and  //  zero  dynamics  mani¬ 
folds  are  separate. 

Here  we  see  that  the  stable  inversion  problem  is 
uansformed  into  a  two-point  boundary  value  problem  for  which 
the  num'oer  of  equations  is  reduced.  However,  it  is  still  a  non¬ 
trivial  numerical  problem.  The  difficulty  anses  because  of  the 
instability  of  die  reference  dynamics  in  both  positive  and  nega¬ 
tive  time.  Existing  approaches,  for  example  the  shooting 
method,  do  not  perform  well  numerically  for  unstable  systems. 

In  the  case  of  minimum-phase  s>  stems,  the  reference 
dynamics  is  asymptotically  stable  in  the  forward  time.  The  size 
of  the  stable  mamfold  is  the  same  as  dt^t  of  the  zero  dynamics 
manifold  and  the  unstable  manifold  reduces  to  the  origin  only. 
Tnerefore.  the  boundary  condidon  fl'tilffo))  =  0.  reduces  to 
rjiro)  =  0  and  5'(ti(;^))  =  0  imposes  no  extra  constraints.  And 
the  two-point  boundary  value  problem  reduces  to  a  simple  inidal 
condidon  problem  with  an  asymptodcally  stable  dynamics,  and 
can  oe  easily  integrated  in  the  forward  time.  This  is  Hirschom's 
appioach.  Similarly,  if  the  zero  dynamics  is  completely 
unstable,  the  two-point  boundary  value  problem  reduces  to  a 
final-value  problem  and  can  be  easily  integrated  in  backward 
tune. 

Another  simple  situadon  is  when  the  stable  and  unstable 
part  of  the  reference  dynamics  can  be  decoupled  by  change  of 
coordinates.  This  happens  when  the  reference  dynamics  is  a 
linear  time-invariant  system  driven  by  the  reference  output  and 
its  derivadves.  In  such  cases,  we  can  easily  integrate  the  stable 
part  in  forward  time  and  the  unstable  part  in  backward  time. 


4.  .An  Examole 

In  this  section,  both  the  inversion  and  regulator 
approaches  will  be  applied  to  a  simple  nonlinear  nonminimum- 
phase  system.  The  example  system  is  selected  such  that  both 
the  integradon  of  the  reference  dynamics  and  the  solution  to  ihe 
noniinear  pardal  differendal  equadons  are  manageable.  The  per¬ 
formance  of  the  two  approaches  is  compared. 

.Now  consider  a  slightly  nonlinear  single-input  single- 
output  system  described  by  the  following  equadons: 


* 

.  - 

- 

Xl 

-Xl  -I-  X2 

0 

Xz 

-3xi  -1-  xj* 

2  -t-  sin^4 

X} 

— 

X,  -  2X3 

+ 

0 

Xt 

_  -x*  -hXf 

0 

y  =xi-  3x,. 

The  reference  output  ffajectory  is  given  by: 

f  2(l-cos(t))  f  6  [0.  2n]. 

0  otherwise, 

and  is  depicted  in  Figure  2  with  a  dotted  curve! 

First,  let  us  consider  the  regulator  approach.  The  refer¬ 
ence  signal  can  be  exactly  generated  by  the  following  linear 
time-invariant  exo'ystem; 

W,  =  Wj. 

Wj  =  “W),' 

Wj  =  0, 

yi  =  wj  -  w,. 

The  initial  conditions  are  set  and  reset  as  follows: 

»[(— oo)  =  Wl(— oo)  =  Wjf— oo)  =  0; 

W)(0)  =  >1^3(0)  =  2,  w;(0)  =  0; 

'v,(2jt)  =  wj(2n)  =  W3(2n)  =  0. 

The  zero  erro'  manifold,  x  =  x(w)  and  u=a(w),  is  obtained  by 
solving  a  system  of  noniinear  pardal  differential  equations, 
which  IS  in  general  extremely  difficult  if  not  impossible.  For 
this  example,  the  partial  differential  equadons  are  as  follows: 

vX;(w)  9x;(w) 

— n - w, - :: - w,  =  -r,(w)  ->-  X;(w). 

t-W[  =  -3x2(w)  -t-  .rj’fw)  +  (2  -t-  sin^4(w  ))u(w). 

dw.  dv/i 


qj^(w) 

ow, 


wj  ■ 


) 


wi  =  X,(w)  -  2l3(w). 


dal(w)  dxi(w) 

- w, - H - w,  =  -x.fw)  -t-  X3-(lv), 

dwi 


1/^ 


aw  1 


sub'ect  to 


Xi(w)  -  3X3(w)  =  Wj  -  w, 


794 


Forainacely,  we  are  able  lo  get  a  closed-form  solution  as  fol¬ 
lows: 

/  X  5  3  . 

=  -  Y'*'i  - 

x^w)  =  -  7w,  -  4w4  -  2^3, 

I3(jv)  =  — ^(Wi-l-Wj)  -  Wj. 

xjw)  =  -^—W?  -^W-?  -t-  — —  W1W3  -t-  W3*, 

20  ‘  20  "  10 

u(n( )  =  (  -  l7wi  -  Iwi  +  I8W3)  /  (2  sinSttfw)). 

Now  note  that  with  u  =  0,  the  forward  system  is  locally  asymp¬ 
totically  stable  since  the  Jacobian  matrix  of  /(x)  at  x  =  0  is 
cleariy 

-110  0 
0-300 
10-20 
0  0  0  -1 

and  has  ail  its  eigenvalues  negative.  Therefore,  for  simplicity, 
we  can  choose  the  feedback  gam  to  be  zero. 

This  completes  the  regulator  design.  The  simulation 
results  are  shown  in  Figure  2  to  4.  Figure  2  compares  the 
desired  and  actual  output  trajectory.  Figure  3  shows  the  state  tra¬ 
jectories  solving  the  partial  differential  equations,  and  Figure  4 
the  actual  state  trajectories.  Note  that  the  output  generated  by 
the  regulator  does  asymptodcaily  crack  the  reference  trajectory 
as  predicted  by  theory.  This  is  evidenced  by  the  segments  from 
;  4  to  f  =  2Jt  and  t  >  9.  However,  there  are  substantial  tran¬ 

sient  tracking  error  both  when  getting  onto  the  zero  error  mani¬ 
fold  and  getting  off  the  manifold.  This  phenomenon  is  not  a 
special  case  of  this  example,  but  rather  generic. 

Next,  let  us  consider  the  stable  inversion  approach.  To 
partially  linearize  the  system,  we  differentiate  the  output  y  to 
yield 

y  =  Xi  -  3x3  =  -4x,  +  Xj  6x3. 

Since  the  conaol  u  does  not  appear  explicitly,  we  differentiate  y 
again  to  yield 

y  =  -4(-xi  -t-  X;J  -  3x3  -t-  xi*  w  (2  +  sin^4)u  +  6(xi  -  2x3) 

=  :  10x|  -  7x2  -  12x3  +  x’ )  +  (2  -t-  sin^t4)«  =  ct(x)  P(x)u. 

Now  not  only  does  u  appear,  its  coefficient  p(x)  ^  0  for  all  x. 
Hence,  wc  can  set 

p(j:) 

and  introduce  a  change  of  coordinates: 


The  inverse  transformation  is  given  by: 


Using  the  feedback  of  equation  (26),  the  system  in  the  new 
coordinates  becomes: 

y=yi.  (2S) 

Til  =111  +  y. 

TIZ  = -iiz  +  Tlf- 

Since  we  assume  partially  (the  output  pan)  correct  initial  condi¬ 
tions,  equation  (28)  leads  to 

y  =yd- 

Then  the  above  q  dynamics  with  y  substituted  with  y^  charac¬ 
terize  our  reference  dynamics.  And  for  r  outside  [to,  Cf], 
yd  =  0,  giving  the  zero  dynamics: 

1)1  =1)1. 

1)2  =  "1)2  -r  1)l'- 

This  is  an  autonomous  system.  It  is  clearly  hyperbolic,  since  its 
nrst  order  approximation  has  eigenvalues  1  and  -1.  Therefore 
there  exist  stable  and  unstable  manifolds.  The  stable  manifold 
can  be  easily  seen  to  be  characterized  by 

1),  =  0. 

And  the  unstable  manifold  is  characterized  by 


Therefore  the  two-point  boundary  value  problem  is  given  by: 


ni  =  T)i+y.i.  i)t(^/)  =  o. 

,  ,  Fi^fo) 
1)2  =  -1)2  +  1)1.  1)2(fo)  =  — - ■ 


This  paniculir  example  is  in  a  triangular  form  and  can  be  easily 
solved.  For  the  first  equation  is  antistable  with  a  final  value 
condition,  it  can  be  easily  integrated  backward  in  time  to  give 
qi.  The  integration  is  continued  into  the  time  f  <  tg  and 
stopped  when  Iqjl  is  sufficiently  small.  Once  we  have  qi.  ’Jic 
second  equation  is  a  stable  system  with  an  initial  condition  and 
driven  by  a  known  input  qj^.  Integration  forward  in  time  is  no 
problem  either.  For  die  part  of  qa  before  fg,  we  use  the  simple 
algebraic  .'elation  of  equation  (29)  since  the  trajectory  remams 
on  the  unstable  manifold. 


Once  qt  and  qa  are  calculated,  the  desired  trajectory  of 
the  original  states  can  be  obtained  using  the  inverse  coordinate 
trar,sforTnation  in  equation  (27)  with  y  =  y^  and  y  =yj.  Then 
the  nominal  control  input  is  calculated  according  to  the  lineariz¬ 
ing  feedback  law  in  equation  (26).  Note  that  the  q's  are 
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nonzero  for  t  <  !q,  the  u  thus  obtained  is  also  nonzero  for 
;  <  .'o.  corresponding  to  a  noncausal  mput. 

Again,  since  the  forward  system  is  asymptotically  stable, 
we  choose  zero  feedback  gain  for  simplicity.  The  simulation 
results  are  shown  in  Figures  5  to  7.  Figure  5  compares  the 
desired  and  actual  output  trajectories.  Figure  6  shows  the  nomi¬ 
nal  state  trajectories  by  inversion,  and  Figure  7  the  actual  state 
trajectories  generated  by  the  truncated  nominal  input.  Note  that 
an  almost  perfect  output  cracking  is  obtained  using  a  mild  con¬ 
trol  efforL  .A  stabilizing  feedback  loop  is  e.xpected  to  --nprove 
t.hose  small  errors. 

5.  Conclusion 

The  primary  contribution  of  this  chapter  is  the  connection 
made  between  zero  dynamics  and  stable  inversion  of  nonlinear 
systems.  These  results,  in  conjunction  with  Hirschom's  show 
that  there  are  multiple  inverses  for  noruninimum-phase  nonlinear 
systems  -  bounded,  noncausal  solutions  produced  with  our 
method  and  unbounded,  causal  solution  produced  using 
Hirschom’s  technique.  These  inversion  techniques  are  funda¬ 
mental  to  nonlinear  tracking  controllers  which  use  feed-forward 
in  conjunction  with  stabilizing  feedback.  Future  work  wiil 
include  conditions  for  existence  of  solutions  to  the  two-point 
'ooundary-valuc  problems  and  global  behavior  of  inverses. 
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Figure  2.  Desired  and  acmal  output 
in  inversion  approach 

Figure  3.  Desired  state  trajectory  in 
inversion  approach 

Figure  4.  Actual  state  trajectory  in 
inversion  approach 


Figure  5.  Desired  and  actual  output  in 
regulator  approach 

Figure  6.  Desired  state  trajectory  in  re¬ 
gulator  approach 

Figure  7.  Actual  state  trajectory  in  re¬ 
gulator  approach 
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Abstract 

This  paper  addresses  (be  inversion  of  nonlinear  systems  of  the  form 
X  =/  (x)  *  g(x)u;  y  =h(x)  from  the  perspective  of  nonlinear  geometric 
control  theory.  We  use  the  notion  of  zero  dynamics  for  obuining  stable, 
though  noncausal,  inverses  for  nonminimum  phase  systems.  This  contrasts 
wtth  the  causal  inverses  proposed  by  Hirschom  where  unstable  zero  dynam¬ 
ics  result  in  unbounded  solutions  to  the  inverse  problem.  We  show  that 
hyperbolicity  of  the  zero  dynamics  equilibrium  guarantees  invertibility. 

1.  Introduction 

Inversion  of  nonlinear  nonminimum-phase  systems  is  a  challenging 
problem  encountered  m  the  control  of  articulated  flexible  space  strucmres. 
flexible  manipulators  and  elsewhere.  A  nominal  input,  derived  by  inversion, 
which  produces  a  desired  output  is  an  excellent  feedforward  control  which 
can  then  be  stabilized  with  feedback.  Motivated  by  i.  v  results  in  the  control 
of  articulated  flexible  structures  [1]  we  explore  the  invenion  approach  to 
nonlinear  systems  with  unstable  zero  dynamics. 

An  alternative  approach  is  to  apply  the  nonlinear  regulator  theory  and  solve 
the  associated  partial  differential  equations.  For  the  linear  multivanable 
case,  the  asymptotic  tracking  problem  was  solved  by  [2|  and  subsequently 
crystalized  the  internal  model  principle.  The  matrix  equations  defining  a 
asymptotic  tracking  controller  for  linear  systems  are  easily  solvable  but 
translate  to  nonlinear  partial  differential  equations  in  the  nonlinear  case 
which  pose  severe  computational  problems  (see  Isidori  and  Byrnes  13]  ). 
This  was  avoided  in  flexible  robot  control  by  tracking  periodic  trajectories 
where  Founer  series  methods  have  been  applied  [•»].  The  class  of  periodic 
trajectories  is  poorly  suited  to  such  systems  however. 

Transient  behavior  is  an  issue  in  tracking  control  in  contrast  to  inversion 
approaches  where  deadbeat  control  can  be  acheived.  For  linear  multivan¬ 


able  systems  the  inversion  problem  has  been  resolved  to  a  large  degree  by 
Mesarovic  and  Brocken  [S]  and  Silverman  [6].  However,  these  Inverses  are 
all  causal.  Conditions  for  the  invertibility  of  nonlinear  real-analytic  systems 
have  been  derived  by  Hirshom  [7].  Here  again,  only  causal  inversion  is 
addressed.  Deeply  rooted  in  these  inversion  results  is  the  notion  of  relative 
degree  and  is  important  in  our  work  also  (although  a  clear  exposition  of  rela¬ 
tive  degree  for  nonlinear  systems  is  fairly  recentlSI ). 

In  this  paper  we  prove  a  connection  between  the  hypeibolicity  of  zero 
dynamics  and  the  invertibility  of  nonlinear  nonminimum  phase  systems.  For 
systems  with  unstable  zero  dynamics  these  inverses  are  necessanly  non¬ 
causal. 

The  remainder  of  the  paper  is  organized  as  follows.  In  section  2  we  review 
some  concepts  from  nonlinear  geometric  control  theory  and  formulate  our 
inversion  problem.  Section  3  contains  the  mam  result  and  shows  thar  inier- 
tibilty  can  be  tested  by  examining  the  zero-dynamics  of  the  system  in  ques¬ 
tion.  Our  conclusions  are  made  in  section  5.  The  appendix  contains  a  key 
technical  result  on  the  Frechet  differentiability  of  solutions  of  ordinary  dif¬ 
ferential  equations  with  respect  to  a  conuol. 

2.  Framework  and  Problem  Statement 

We  consider  a  nonlinear  system  of  the  form 

x=f(x)  +  g{x)u  <2121 

y=h(x),  <2Ab) 

defined  on  a  neighborhood  X  of  the  origm  of  /?".  with  input  u  €  R"'  and 
output  y  e  .  f  (x).  g,{x)  (the  i th  column  of  g (x  »  i  =1 .  2,  .m  are 
smooth  vector  fields  and  A,(x)  for  i  =  l,  2,  ■  ■  .p  are  smooth  functions  on 
X.  with /(0)=0  and  h(0)=O. 

In  the  context  of  the  above  system  pose  the  following 
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Stable  Inversion  Problem:  Given  a  smooth  "desired"  output  trajectory 
y^(r)  with  compact  support  (0,7],  find  a  corresponding  control  input  u^(/) 
and  a  state  trajectory  jrj(r)  such  that 

1 )  Uis  and  Xis  satisfy  the  differential  equation 

id(‘)  =  f  * g(Xi(t))u^(t)  (2.2) 

2)  exaa  output  tracking  is  achieved: 

fi(jc^(r))  =  yj(r).  (2.3) 

3)  Ud  and  are  bounded  and 

Ud(t)->0,  .t^{/)-*0  as  r-»±cx>.  (2.4) 

We  call  Xd  the  desired  state  trajectory  and  the  desired  control 
corresponding  to  the  desired  output  trajectory.  These  can  be  incorporated 
into  a  dead-beat  controller  by  using  as  a  feedforward  signal  and  x  -  xj  is 
an  error  signal  for  feedback.  The  design  of  the  feedback  compensator  has  no 
general  solution,  but  controllers  for  specific  systems  have  been  developed 
(see  e  g.  Paden  et  al.  19)  ). 


In  solving  for  the  trajectories  Xd  and  the  concepts  of  stable  and 
unstable  manifolds  of  an  equilibrium  point  anse  naturally.  For  the  sake  of 
completeness  we  review  the  definitions  here.  Let  r=0  be  an  equilibrium 
point  of  an  autonomous  system  defined  in  a  open  neighborhood  1/  of  the  ori- 
gm  of  /?": 

i=/(r).  (2.5) 

and  iti|(r)  be  the  solution  passing  through  :  at  t=0.  We  define  the  (local) 
stable  and  unstable  manifolds  W'.  W*  as  follows: 

WJ=(r  e  U  I  di(r)-*0ast-*oo,  and0,(r)e  U  V  t  iO) 

W“=lz  e  U  I  i5,(r  )-»0  as  and  d,(r  )e  U  ^  !  iO). 


The  equilibrium  point  r=0  is  said  to  be  hyperbolic  if  the  Jacobian 
main*  Df  of  f  at  r  =0  has  no  eigenvalues  on  the  j (i)  axis.  Let  n *  denote  the 
number  of  eigenvalues  of  Df  in  the  open  left  half  complex  plane,  and  n" 


the  number  in  the  open  right  half  plane.  In  this  case,  stable  and  unstable 
manifolds  W’  and  W“  exist  locally  and  have  dunensions  n’  ■■  n“  respec¬ 
tively. 


For  convenience,  we  will  use  the  following  notation:  /V=0.  1.  2.  ■  . 
r=(r|.r2.  •  •  ■  .r„f  and  y  =(y  i(i).  y  lU ).  ■  •  ■.>«(<))'';  t  e  /?.  Also 


define 


d''yi 

dt'' 

d''yi 

dt'' 

d''y„ 

dt'- 


(2.6) 


We  will  use  the  bold  number  I  to  denote  the  vector  (1.  1.  .  1)'"  so  that 


IT’ 


If  y:R''-*R’”  and  [.R'-vR',  we  define  Lfy=l^f(x), 


Lfy  -  Lf  (Lf~'y )  and 


i-fy 


Lf'yx 

Lfbi 

L'rym 


(2.7) 


3.  Inversion  of  Partially  Linearizable  Systems 

Consider  a  nonlinear  system  of  the  form  t2.1)  with  the  same  number  m 
of  inputs  and  outputs  which  we  expand  in  the  following  forni: 

x=f(x)*^g,lx)Ui  (3  1a) 

yi=A|(-t) 


y«=fim(Jt)  (3.1b) 

We  assume  that  the  system  has  weU-defined  relative  degree 
r  =(r  I .  r  2.  •  • .  )  at  the  equilibrium  point  0.  that  is. 

(i)  for  all  1  S  y  S  m .  for  all  1  S  i  S  tn .  for  all  i  <  r,  -  1.  and  for  all  x  in  a 
neighborhood  of  the  origin, 

LgLfh,(x)=0.  (3.2) 

(ii)  the  mxm  matrix 


'/I  ,(x)  • 

Z.„L/>-'fi2(4)  • 

^-*.^"■''>2(4) 

L,,L}--<h„(x)  ■ 

is  nonsingular  in  a  neighboihood  of  the  origin. 

Under  this  assumption,  the  system  can  be  partially  linearized.  To  do 
this,  we  differentiate  y,-  until  at  least  one  uj  appear  algebraically.  This  will 
happen  at  exactly  the  r.ih  derivative  of  y, .  Define  ql=y/*'''  for 
1=1,  ■  ■  .m  andA:  =  l.  •  ■  .r,,  anddenote 

-  (3.4) 

=  (y,.yi.  •■•.yf''-''.y:-  . .yr'-'V. 

Choose  q(x).  an  n-lrl  dimensional  function  on  R"  such  that 
q*’)*’  =  y(x )  forms  a  change  of  coordinates  with  1 )  y(0)  =  0  and  2)  the 
system  dynamics  (3.1)  become  [8] 


= 

■  for  t  =  1.  •  ,m,  (3  5a) 

=a.(5.’l)-P,(5.n)u 

’i=<7i(5.n)  +  92(5.n)“.  (3.5b) 

which,  in  a  more  compact  form,  is  equivalent  to 

,  y(''>  =  a(5,q)  +  (3(^,q)u 

n  =  <7i(^.  n) +<?2(t  n)“  . 
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where 


Proof:  Let 


>  =C)'l.>2.  ■ 

u  =(ui.ui,  ■  ■  ■  .u„f. 

a(5,ti)  =  L/A('r'(t  n)). 

This  P  is  actually  the  same  P(x )  matrix  defined  in  the  above,  and 
fi(.t )  =  [fi  i(x), 
j (x)  =  [j  i(x),  «2U).  ■  ■  • 

a,  and  P,  are  the  i  th  row  of  a  and  P  respectively.  By  the  relative  degree 
assumption.  P(^.  r^)  is  nonsingular  so  we  can  define  the  feedback  control  law 
u  :=  p-'(5.  n)(v  -  a(^.  n)],  (3.7) 

This  control  linearizes  the  input-output  behavior  to  a  chain  of  integrators: 

y('>  =  v  (3.8) 

where  v  e  /?"■  is  the  new  control  input.  Assume  both  y  (r )  a  0  and  y^U)  a  0 
for  r  <0  and  choose 

v=yi'>  (3.9) 

Then  immediately  we  have 


4  =  ^:= 

(y,n.yd\.  ■  . . yi;-'") (3. lO) 

and  equation  (3.6)  becomes. 

n=p(^.yi".n)  (3.11) 

where 

p(yd’^d-'n)  :=<7i(4u.  n)  - n)p-'(v-'(ii.'i))(yi'’  - a(i(r'(^.n))l- 

For  brevity  we  define  ^  .  yi'  ’)  and  write  (3. 1 1 )  as 

n=F(Cr.n)-  (3.ir) 

We  call  (3.1 1')  the  ^  dynamics.  Equation  (3.1 1)  together  with  (3.8)  deter¬ 
mine  the  linearizing  input  through  (3.7).  Our  goal  is  to  choose  a  particular 
solution  of  (3.11’)  such  that  the  resulting  input  tends  to  zero  at  ±oo  and 
meets  the  other  requirements  of  being  and  inverse  to  y^ . 

For  reference  trajectories  with  compact  support  [0.  T].  the  reference 
dynamics  become  autonomous  zero  dynamics  for  r  outside  the  interval 
[0.  T 1.  The  following  theorem  is  our  mam  result. 

Theorem  (State-space  conditions  for  existence  of  inverse):  Consider  the 
system 

i=f{x)*g(x)u  (3.13a) 

y=h(x)  (3.13b) 

where  /  ,  g  and  h  are  smooth.  Let  0  be  an  equilibrium  point  of  x  =  /  (x )  and 
assume  without  loss  of  generality  that  h(0)  =0.  If  yj(r)  is  a  smooth  desired 
output  with  compact  support  [0.  T)  and  0  is  a  hyperbolic  equilibrium  point 
of  the  zero  dynamics  then  there  is  a  solution  to  the  inverse  problem  provided 
•  ,  «  HFri*  2  is  not  too  large. 


y<'^  =  yi'>  (3.1-ta) 

(3. 14b) 

be  the  normal  form  of  the  system  with  initial  conditions  0  at  -oo  and  v 
chosen  to  cause  tracking  of  the  desired  output.  We  need  to  find  a  solution  to 
the  r|  dynamics  such  that  the  input  u  tends  to  zero  at  too.  Define  be  the 
flow  of  the  T)  dynamics  (3.14b)  from  time  r  to  time  T.  Note  that  there  exists 
an  interval,  say.  [O.T]  outside  which  y  and  its  derivatives  are  zero.  Hence 
the  T)  dynamics  before  0  and  after  T  are  sunply  the  zero  dynamics.  Since 
the  stable  and  unstable  manifolds  of  the  equilibrium  point  0  are  indeed 
smooth  manifolds  there  exists  smooth  functions 

the  stable  and  unstable  manifolds. 


We  claim  that  OiSj-lW'irvlV'  is  nonempty  (an  element  of  this  set  defines  an 
initial  condition  for  selecting  our  solution  to  (3.14b)).  That  is.  the  image  of 
the  unstable  manifold  of  0  under  the  flow  intersects  (he  stable  manifold 
of  0.  This  is  equivalent  to  the  existence  of  a  solution  to 


/  (n.^) 


(6,^(1))) 

fl*(T1) 


(3.16) 


For  ^  =  0  there  is  a  solution  since  the  stable  and  unstable  manifolds  intersect 
at  0.  We  now  employ  the  implicit  function  theorem  to  show  existence  of 
solution  for  I  fl  is  not  too  large.  Since  the  intersection  of  the  stable  and 


unstable  manifolds  intersect  transversally  at  the  equilibnum  0  we  have  that 
Dr/ (.0.0)  is  an  isomorphism.  Also. 


D^(0.0)  = 


(3.171 


The  function  B“  is  smooth  and.  by  the  corollary  to  the  proposition  in  the 


Appendix.  is  continuous.  Hence,  by  the  implicit  function  theorem, 

there  exists  a  solution  Hr  'o  3  "'hen  I  ^1  ^  is  not  too  large.  By  flowuig 
this  solution  point  forward  and  backward  in  tune  we  can  obtain  the  inverse 


Define  the  desired  rj  trajectory  by 

tlu(f)  =  6r^(nr)  (.3  18) 

From  this  and  the  desired  i,  trajectory  we  construct  the  input: 
u(r)  =  -  a(^.Tij)).  Since  tjt-  is  on  the  stable  manifold  and 

the  ^  dynamics  are  the  zero  dynamics  after  time  T.  we  have  that  q-*C)  as 
r  -»oo  .  Also,  ^{r )— *0  as  r  ->oo  because  y,/  has  compact  support.  If  follows 
that  both  u  and  x  tend  to  zero  as  r-»oo.  A  similar  argument  applies  in  back¬ 
ward  time.  On  the  interval  (O.T]  solutions  are  bounded  since  they  depend 
continuously  on  ^  and  •  ^1  „  is  bounded.  Hence  and  inverse  exists. 


In  the  case  of  minimum  phase  systems,  the  reference  dynamics  is 
asymptotically  stable  in  forward  time.  The  stable  manifold  is  the  dimension 
as  the  zero  dynamics  manifold  and  the  unstable  manifold  reduces  to  the  on- 
gui  only.  Therefore,  the  boundary  condition  B“(ri)=0.  reduces  to  q=0  and 
d’(T|)=0  imposes  no  extra  constraints.  The  inverse  can  be  easily  integrated 
in  the  forward  time.  This  is  Hirshom's  approach. 

Similarly,  if  we  consider  the  system  as  evolving  backward  in  time,  and 
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this  reversed  system  has  only  the  origin  as  the  stable  manifold  of  its  zero 
dynamics  then  the  two  point  boundary  value  problem  reduces  to  a  final 
value  problem  and  can  be  easily  uitegrated  in  backward  time. 

Another  simple  situation  is  when  the  stable  and  unstable  part  of  the 
reference  dynamics  can  be  decoupled  by  change  of  coordinates.  This  hap¬ 
pens  when  the  reference  dynamics  is  a  linear  time-invariant  system  driven 
by  the  reference  output  and  its  derivatives.  In  such  cases,  we  can  easily 
integrate  the  stable  part  in  forward  time  and  the  unstable  part  in  backward 
time. 

4.  Conclusion 

The  primary  contribution  of  this  paper  is  the  proof  existence  of  inverses 
when  zero  dynamics  are  hyperbolic.  These  results,  in  conjunction  with 
Hirschom's  show  that  there  are  multiple  inverses  for  nonminimum  phase 
nonlinear  system  -  bounded,  noncausal  solutions  produced  with  our  method 
and  unbounded,  and  causal  solutions  produced  using  Hirschom's  method. 
These  inversion  techniques  are  fundamental  to  nonlinear  tracking  controllers 
which  use  feedforward  in  conjunction  with  stabilizing  feedback.  Funire 
work  will  include  input-output  characterizations  for  existence  of  solutions  to 
the  inverse  problem. 

Appendix 

Proposition:  Consider  the  system 

X  =f(x,u)\  x{0)  =  xo  (A.l) 

on  the  interval  (0.  T]  where  f  R"  xf?"  -*R''  is  smooth  in  z  and  rr  and 
lu  I  ^  ^sug^|lu(r  )l  2  <  M;  r  is  chosen  such  that  solutions  exist  on  [O.TI 

for  all  u  e  satisfying  the  bound. 

Let  denote  the  solution  of  the  differential  equation  on  [O.T].  The 

Frechet  derivative.  Oud„(r  jio):  f.„10.ri->L^(0,r|  of  the  map 

u(  )i-*i|)„(  jto)  (A.2) 

IS  given  by 


The  one-sided  derivative  of  £  is 

=  ni*^E(t),/(ltl„,(rj[o)'“|(')) .  (A. 7) 

where  mjz.yl  is  the  directional  derivative  of  the  2-norm: 
in  «.lz  .y  1  =  ~  ^  smoothness  of  /  implies  its  local 

Lipschitz  continuity  (in  both  arguments).  Since  m+lz.y  )  <lyl  we  have 
•/(i>ui(tJto)'“l('))-/(i>aj(r-xo)-“2(' ))' 

S  Af(e(r)-^liii(r)-u2(OI  !  (A  8) 

for  some  real  K ,  This  implies 

e*  S J-  (A9) 

By  solvmg  this  differential  inequality  we  obtain 

e(r)  S  (lu  \  -  U2i  (A.IO) 

Therefore  we  have  the  desired  Lipschitz  property  with  Lipschitz  constant 

Proof  of  Proposition  (see  [10]  proposition  5.3  for  related  proof  techniques). 
Let  e  >  0.  By  the  Lipschitz  continuity  of  ♦aff  jco)  proven  in  the  lemma  and 
the  smoothness  of  / .  there  exists  r  e  ( 0,  (M  -  •  ^1  „,)  1  such  tliat 

II  /  -'o)-"  +^)  -  /(♦u  ('  0  ^o).u  -to) 

'^»(fJ:o)|-|^■«'y('.tO).a,;(i)ll  2S£I;I„  (A.ll) 

for  all  r  e  (O.TI  and  5  satisfying  1 1^1  ,„  <  Af  -  lu  I  Define  the  error 
between  a  solution  $u(t  ^o)  and  a  perturbed  solution  aq)  by 

5(r)  =  «„♦<;(»  .to) -by  (Mo)-  (A.12) 

(The  idea  of  the  proof  is  to  bound  S(t)-  A(r)  where  A(f )  is  the  right-hand 
side  of  equation  (A. 3).  We  integrate  this  bound  to  get  a  bound  on  5(t )  -  A(r ) 
and.  finally,  we  show  that  6(r )  =  A(r )  in  the  limit  •  0.) 


Oy  by  (f  .r  O)  5  =  ["ff'  -t)  [ba  (XjCglu  (T)i;(T)d  t 


(A.3) 


where  <&(r.r)  is  the  state  transition  matrix  for 
[.^(«a(fJ:o)-«('))  Z. 


(A.4) 


Before  proceeding  with  the  proof  we  establish  the  followmg 

Lemma:  The  mapping  from  L^IO.T  |-.L,,[0,ri  given  by 

u()->by(j:o)  (A.5) 

is  Lipshitz  in  u. 


Differentiating  5(r)  yields 


Ut)=f  (b»*t(t  .zo).  u  (t )  +  C(t ))  -  /  (by  (r  jco),ii  (r )) 

but  this  is  just  the  first  two  terms  in  ( A.  1 1 ).  Hence 

•  «« )  -  -^(ba  (I  Jto))5  -  l^-by  U  jcoKI  S  El  ;i  _. 


(A.13) 

(A.14) 


Define  p(r)  =  I  6(r)  -  A(r  )l  .  The  one-sided  derivative  of  pd)  is  given  by 
P»(r)  =  m48(r)-A(r),5(r)-A(r)l 


Expanding  A  in  the  second  argument,  using  A.14  and  the  propeny 
mjz.y+z]  Sm+(z.y)  +  lzl  we  have 


p4»)  S  m. 


6(r )  -  A(r  ).|f  (by  (iJo).u(i  ))(5(r )  -  A(r ) 


+  »  (A. 15) 


i  ^1,. 


(A.16) 


Proof:  Define 

E(r)  =  li>a,(rzo)-by,(rjroJl2.  (A.6) 


where  8  is  a  umform  bound  on  ■^(tu(ljco),u(i))  for  r  e  [O.TI  and  u  satis¬ 
fying  lul  „  <  Af .  The  existence  of  the  bound  follows  from  basic  existence 
and  uniqueness  theory  for  ordinary  differential  equations,  boundedness  of  u 
and  smoothness  of  / . 
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